Source Code: Making PDFs Searchable with Azure AI
Built this to make it easier for devs to experiment and build real-world AI workflows.
A simple ASP.NET Core Web App built with .NET 8.0 that allows users to:
📂 Upload PDF documents
🧠 Index the content using Azure AI Document Intelligence
🔍 Perform full-text searches across uploaded documents using Azure Cognitive Search
🚀 Features
Clean web interface for uploading and managing PDFs
Automatic extraction of text using OCR
Indexed and searchable content via Azure
Full-text query support for retrieving relevant documents
🛠️ Tech Stack
ASP.NET Core (.NET 8.0)
Azure AI Document Intelligence
Azure Cognitive Search
Razor Pages / MVC
📦 Getting Started
Clone the repository:
Configure Azure services (Document Intelligence & Cognitive Search) in Azure Portal: https://blog.habeebyakubu.com/p/tutorial-converting-pdfs-into-searchable
Add your keys and endpoint URLs in
appsettings.json
Run the app:
dotnet run
💡 Ideas for Enhancement
Add authentication (e.g. Azure AD, Identity)
Filter or categorize search results
UI improvements (responsive layout, loading indicators, etc.)
Logging and error handling
🛠 GitHub repo here: dotnetyatch/PdfSearchApp
Happy to get feedback or contributions from the community!
That’s all for today, see you in the next one.