Job Title:
Cutting-Edge Multimodal AI Research Opportunities
Company: beBeeMultimodal
Location: Amravati, Maharashtra
Created: 2025-10-16
Job Type: Full Time
Job Description:
Job Title: Multimodal AI Research Intern We are seeking exceptional AI/ML research interns to work on cutting-edge multimodal models and agent-driven systems. This role is designed for individuals who thrive in fast-paced environments, stay ahead of AI advancements, and can deliver real value with minimal guidance. You will be working on building, fine-tuning, and evaluating multimodal LLMs (text, image, video, audio) using PyTorch and Hugging Face. You will design autonomous AI agents and workflows using LangChain, AutoGen, CrewAI, LlamaIndex, Haystack, etc. You will implement retrieval-augmented generation (RAG) pipelines with vector databases (Pinecone, Weaviate, Faiss). You will develop and deploy production-grade APIs (Flask/FastAPI, Docker, GitHub Actions, CI/CD). You will conduct independent research on cutting-edge models, frameworks, and tools – then apply findings to real-world use cases. Required Skills and Qualifications: Strong foundations in deep learning and machine learning (CNN, RNN, LSTM, Transformers, Attention). Exposure to multimodal models (e.g., GPT-4o, Claude, Gemini, LLaVA, Kosmos-2). Excellent Python skills (NumPy, Pandas, PySpark). Experience with APIs, Flask/FastAPI, Docker, GitHub Actions. Familiarity with cloud platforms (AWS/GCP/Azure) is a plus. Self-motivated, independent researcher who thrives on solving problems without step-by-step guidance. Benefits: Work on cutting-edge AI projects in multimodality and agent-driven systems. Mentorship from AI practitioners solving real-world challenges. Flexible but CST-aligned hours. A chance to publish and showcase work (papers, repos, demos). Conversion to paid role/full-time opportunity for outstanding contributors.