Job Title:
NLP Engineer
Company: Devnagri AI
Location: New delhi, Delhi
Created: 2026-01-14
Job Type: Full Time
Job Description:
Job Description: Senior NLP / AI Engineer – DevnagriRole: NLP/AI Engineer Experience: 3+ years Job Type: Full-TimeMode: HybridLocation: BGR Tower, Sector 16A, FilmCity, NoidaAbout the RoleDevnagri is seeking an experienced Senior NLP/AI Engineer to lead development of cutting-edge AI systems powering translation, transliteration, ASR, OCR, LLM, and RAG products serving enterprise and government clients. This role requires expertise in building, optimizing, and deploying production-grade NLP and speech models at scale, along with strong infrastructure, model engineering, and cross-functional collaboration skills. You will be responsible for designing innovative AI solutions, training and optimizing models, improving system performance, and ensuring high availability of AI services used across Devnagri’s platform, chatbot systems, and enterprise AI deployments.Key Responsibilities1. AI Model Development & TrainingTrain, finetune, and deploy models across multiple domains:Multilingual Neural Machine Translation (NMT), Adaptive Translation SystemsMultilingual Transliteration models (Indian languages)Speech-to-Text (ASR / Whisper / Nvidia Nemo / Indic-ASR)Text-to-Speech (TTS)Large Language Models (LLMs)Embedding models for RAGBuild multilingual models supporting 20+ Indian languages.Perform dataset creation, preprocessing, augmentation, and large-scale training.Conduct model benchmarking using chrf++, BLEU, WER, CER, and custom evaluation metrics.Convert models to optimized inference formats (CTranslate2, Faster-Whisper, AWQ/INT4/INT8 quant).2. Model Optimization for ProductionReduce model sizes through quantization and pruning.Optimise inference speed improvements for real-time workloads.Optimize GPU/CPU utilization and memory footprint for large models.Build scalable inference pipelines for translation, ASR, and RAG.3. Audio & Video Processing SystemsDevelop advanced audio transcription and translation pipelines.Implement real-time STT systems for indic languages.Build video subtitle extraction and SRT translation workflows.Integrate diarization, language detection, summarization, and cross-lingual translation.4. RAG & LLM-Based SystemsArchitect multilingual Retrieval-Augmented Generation (RAG) pipelines.Build vector databases and embedding models.Implement document indexing, chunking, parsing, and hybrid retrieval search.Integrate LLMs (Llama, Gemma, Qwen etc.) for chatbot and voice-bot systems.5. Infrastructure & Server ManagementManage AI/ML servers on AWS & GCP (GPU VM provisioning, optimization).Reduce infra cost by optimizing GPU usage, scheduling, and server consolidation.Implement auto-restart, monitoring, logging, and fail-safe mechanisms for all AI services.Deploy high-availability APIs for translation, transliteration, ASR, OCR, and chatbots.Familiarity with cloud-based GPU environments and troubleshooting (NVIDIA drivers).6. Cross-Functional OwnershipWork with Sales, Ops, Tech teams to troubleshoot, support clients, and deliver large projects.Maintain detailed documentation for product flows, APIs, model deployments.Handle urgent escalations, server crashes, and mission-critical deployments.Create internal tools and FAQs to reduce dependency on the AI team.Required Skills & ExperienceTechnical SkillsStrong background in NLP, Speech, Deep Learning, and Generative AI.Experience: 4-5 years in production ML/NLP systemsHands-on experience with:Python, PyTorch, TensorFlowSpeech to text and Text to speech models, open source LLMs, Transformer architecturesCTranslate2, Faster-Whisper, ONNX RuntimeLLM inference frameworks like, vLLM, Sglang, LLM quantization techniquesVector DBs (FAISS, Pinecone)Docker, FastAPI, Linux systemsAWS/GCP GPU InfrastructureExpertise in multilingual NLP, especially Indian languages.Experience creating datasets and training models from scratch.Bonus SkillsExperience with, WebRTC or real-time streaming protocolsFrontend basics for AI demo dashboards (Streamlit/Gradio).Knowledge of TTS, voice pipelines, barge-in systems, or telephony APIs.Experience with NVIDIA NeMo or similar speech frameworksSoft SkillsStrong ownership and accountability.Excellent communication and documentation clarity.Ability to independently research, prototype, and deploy new systems.Strong prioritization and deadline management.Ability to handle high-pressure production issues.What You’ll OwnAs a Senior NLP Engineer, you will oversee:20+ production AI services (maintaining 99%+ uptime)6+ major model families (NMT, Transliteration, ASR, TTS, LLMs, RAG, OCR)State-of-the-art models outperforming industry benchmarks (20-75% better than Google ASR)Infrastructure optimization reducing GPU costs by 50%+Enterprise Conversational AI systems for banks, government, and Fortune 500 clientsMulti-model deployment at scale using quantized and CT2-optimized modelsWhy Join DevnagriWork at the intersection of NLP, ASR, GenAI, and Indic languagesFreedom to research, experiment, and train large-scale models, access to cutting edge GPUs.Opportunity to build products used by top enterprises & government clientsAmazing culture where innovation is core to the AI teamInterview PipelineScreening (Telephonic)AssignmentTechnical RoundDirector DiscussionHR ClosureIf you are looking to work in a product company and have a Go-Getter Attitude, share your updated CV With the below-mentioned detailsRelevant Experience with NLP:Current CTC:Expected CTC:Notice Period (Officially and if you're serving then LWD)Open to work from the office - Noida LocationRead the whole Job description before applying: yes/ no