Job Title:
Artificial Intelligence Research Engineer
Company: TalentGum
Location: Jaipur, Rajasthan
Created: 2025-11-18
Job Type: Full Time
Job Description:
About TalentGumTalentGum is transforming extracurricular learning for children aged 5–14 years through engaging live online courses in music, dance, chess, and public speaking etc.Our mission is to build the next generation of learning intelligence — an AI-driven platform that can observe, understand, and help children improve their creative and cognitive skills through personalized, real-time insights.We are looking for AI Research Engineers who are passionate about bridging human creativity and machine intelligence, and want to work at the intersection of AI, performance, and education.Role OverviewAs anAI Research Engineer , you’ll design and develop machine learning systems that canunderstand and evaluate human performance— starting with sound (music, speech) and later expanding to vision (movement, gestures). You’ll collaborate withAudio/Vision Engineers ,Backend Developers , andSubject Matter Experts (SMEs)such as musicians, dancers, and coaches to convert expert intuition into measurable, scalable AI feedback.Key Responsibilities • Research, design, and trainAI modelsfor real-time performance evaluation across domains (music first, then dance/speech/chess). • Implement and optimizedeep learning architecturesfor audio and/or visual understanding (CNNs, RNNs, Transformers). • Work closely withAudio/Vision Engineersto build data pipelines for clean, real-time feature extraction (spectrograms, keypoints, pose sequences). • Collaborate withSMEsto define “performance quality” metrics and label datasets. • Developevaluation frameworksto quantify model accuracy vs. expert feedback. • Experiment withcross-modal fusion(audio + vision) for synchronized analysis in future domains like dance. • Optimize models forlow-latency inferenceon web/mobile devices (ONNX, TensorRT, TF Lite). • Document research findings, prototype outcomes, and contribute to internal knowledge-sharing.Required Skills & Experience • 3+ years of hands-on experience inMachine Learning / Deep Learning(PyTorch, TensorFlow). • Strong mathematical foundation insignal processing, time-series analysis, and statistics . • Proven experience withaudio or visual data— music, speech, motion, or similar perceptual domains. • Familiarity withMIR (Music Information Retrieval)orComputer Visiontasks like: o Pitch detection, beat tracking, timbre classification, speech analysis, o Pose estimation, gesture recognition, or motion tracking. • Experience withmodel optimization and deployment(TorchScript, ONNX, TensorRT). • Strong Python skills and familiarity with libraries such as NumPy, pandas, Librosa, Essentia, OpenCV, or MediaPipe.Nice to Have • Research or published work inaudio AI, multimodal AI, or performance evaluation . • Experience building or experimenting withreal-time ML inference systems . • Background inmusic, performing arts, or educational AI . • Familiarity withcloud platforms(AWS, GCP) and CI/CD for ML (MLflow, DVC). • Curiosity and creativity in experimenting with human-centered AI.What You’ll Achieve • Shape the foundation of TalentGum’sAI Learning Intelligence Platform . • Transform expert musical and artistic insights intomeasurable, adaptive AI systems . • Build models thatlisten, see, and guideyoung learners around the world. • Contribute to a system that evolves acrossmusic, dance, public speaking, and chess .Why Join Us • Work at theintersection of AI, art, and education . • Collaborate with passionate technologists and subject matter experts. • Creative freedom to explore cutting-edge models in audio and vision AI. • Build something with direct impact — helping children discover their best selves. • Competitive compensation, equity options, and global career visibility.