π’ Beginner Projects
Perfect for getting started with AI engineering. These projects focus on single components and straightforward implementations.
#### OCR & Vision
- [LaTeX OCR with Llama](./LaTeX-OCR-with-Llama) - Convert LaTeX equation images to code using Llama 3.2 vision
- [Llama OCR](./llama-ocr) - 100% local OCR app with Llama 3.2 and Streamlit
- [Gemma-3 OCR](./gemma3-ocr) - Local OCR with structured text extraction using Gemma-3
- [Qwen 2.5 OCR](./qwen-2.5VL-ocr) - Text extraction using Qwen 2.5 VL model
#### Chat Interfaces & UI
- [Local ChatGPT with DeepSeek](./local-chatgpt%20with%20DeepSeek) - Mini-ChatGPT with DeepSeek-R1 and Chainlit
- [Local ChatGPT with Llama](./local-chatgpt) - ChatGPT clone using Llama 3.2 vision
- [Local ChatGPT with Gemma 3](./local-chatgpt%20with%20Gemma%203) - Local chat interface with Gemma 3
- [DeepSeek Thinking UI](./deepseek-thinking-ui) - ChatGPT with visible reasoning using DeepSeek-R1
- [Qwen3 Thinking UI](./qwen3-thinking-ui) - Thinking UI with Qwen3:4B and Streamlit
- [GPT-OSS Thinking UI](./gpt-oss-thinking-ui) - GPT-OSS with reasoning visualization
- [Streaming AI Chatbot](./streaming-ai-chatbot) - Real-time AI streaming with Motia framework
#### Basic RAG
- [Simple RAG Workflow](./simple-rag-workflow) - Basic RAG with LlamaIndex and Ollama
- [Document Chat RAG](./document-chat-rag) - Chat with documents using Llama 3.3
- [Fastest RAG Stack](./fastest-rag-stack) - Fast RAG with SambaNova, LlamaIndex, and Qdrant
- [GitHub RAG](./github-rag) - Chat with GitHub repos locally
- [ModernBERT RAG](./modernbert-rag) - RAG with ModernBert embeddings
- [Llama 4 RAG](./llama-4-rag) - RAG powered by Meta's Llama 4
#### Multimodal & Media
- [Image Generation with Janus-Pro](./imagegen-janus-pro) - Local image generation with DeepSeek Janus-pro 7B
- [Video RAG with Gemini](./video-rag-gemini) - Chat with videos using Gemini AI
#### Other Tools
- [Website to API with FireCrawl](./Website-to-API-with-FireCrawl) - Convert websites to APIs
- [AI News Generator](./ai_news_generator) - News generation with CrewAI and Cohere
- [Siamese Network](./siamese-network) - Digit similarity detection on MNIST
---
π‘ Intermediate Projects
Multi-component systems, agentic workflows, and advanced features for experienced practitioners.
#### AI Agents & Workflows
- [YouTube Trend Analysis](./Youtube-trend-analysis) - Analyze YouTube trends with CrewAI and BrightData
- [AutoGen Stock Analyst](./autogen-stock-analyst) - Advanced analyst with Microsoft AutoGen
- [Agentic RAG](./agentic_rag) - RAG with document search and web fallback
- [Agentic RAG with DeepSeek](./agentic_rag_deepseek) - Enterprise agentic RAG with GroundX
- [Book Writer Flow](./book-writer-flow) - Automated book writing with CrewAI
- [Content Planner Flow](./content_planner_flow) - Content workflow with CrewAI Flow
- [Brand Monitoring](./brand-monitoring) - Automated brand monitoring system
- [Hotel Booking Crew](./hotel-booking-crew) - Multi-agent hotel booking with DeepSeek-R1
- [Deploy Agentic RAG](./deploy-agentic-rag) - Private Agentic RAG API with LitServe
- [Zep Memory Assistant](./zep-memory-assistant) - AI Agent with human-like memory
- [Agent with MCP Memory](./agent-with-mcp-memory) - Agents with Graphiti memory and Opik
- [ACP Code](./acp-code) - Agent Communication Protocol demo
- [Motia Content Creation](./motia-content-creation) - Social media automation workflow
#### Voice & Audio
- [Real-time Voice Bot](./real-time-voicebot) - Conversational travel guide with AssemblyAI
- [RAG Voice Agent](./rag-voice-agent) - Real-time RAG Voice Agent with Cartesia
- [Chat with Audios](./chat-with-audios) - RAG over audio files
- [Audio Analysis Toolkit](./audio-analysis-toolkit) - Audio analysis with AssemblyAI
- [Multilingual Meeting Notes](./multilingual-meeting-notes-generator) - Auto meeting notes with language detection
#### Advanced RAG
- [RAG with Dockling](./rag-with-dockling) - RAG over Excel with IBM's Docling
- [Trustworthy RAG](./trustworthy-rag) - RAG over complex docs with TLM
- [Fastest RAG with Milvus and Groq](./fastest-rag-milvus-groq) - Sub-15ms retrieval latency
- [Chat with Code](./chat-with-code) - Chat with code using Qwen3-Coder
- [RAG SQL Router](./rag-sql-router) - Agent with RAG and SQL routing
#### Multimodal
- [DeepSeek Multimodal RAG](./deepseek-multimodal-RAG) - MultiModal RAG with DeepSeek-Janus-Pro
- [ColiVara Website RAG](./Colivara-deepseek-website-RAG) - MultiModal RAG for websites
- [Multimodal RAG with AssemblyAI](./multimodal-rag-assemblyai) - Audio + vector database + CrewAI
#### MCP (Model Context Protocol)
- [Cursor Linkup MCP](./cursor_linkup_mcp) - Custom MCP with deep web search
- [EyeLevel MCP RAG](./eyelevel-mcp-rag) - MCP for RAG over complex docs
- [LlamaIndex MCP](./llamaindex-mcp) - Local MCP client with LlamaIndex
- [MCP Agentic RAG](./mcp-agentic-rag) - MCP-powered Agentic RAG for Cursor
- [MCP Agentic RAG Firecrawl](./mcp-agentic-rag-firecrawl) - Agentic RAG with Firecrawl
- [MCP Video RAG](./mcp-video-rag) - Video RAG using Ragie via MCP
- [MCP Voice Agent](./mcp-voice-agent) - Voice agent with Firecrawl and Supabase
- [SDV MCP](./sdv-mcp) - Synthetic Data Vault orchestration
- [KitOps MCP](./kitops-mcp) - ML model management with KitOps
- [Stagehand Γ MCP-Use](./stagehand%20x%20mcp