Job Title:
Senior Data Scientist
Company: Bluo Software India LLP
Location: Pune, Maharashtra
Created: 2026-04-05
Job Type: Full Time
Job Description:
Job Title: Senior Data ScientistCompany: Dataism ServicesLocation: Viman Nagar, Pune, Maharashtra, IndiaExperience: 5+ YearsEmployment Type: Full-TimeAbout the CompanyDataism Services is a technology and engineering services company delivering advanced solutions in AI, machine learning, software engineering, and data systems across multiple sectors including petroleum refining, manufacturing, energy, and heavy industry. We build production-grade systems that improve operational performance, reduce cost, and accelerate innovation.Role OverviewWe are hiring a Senior Data Scientist to design, develop, and deploy AI/ML solutions for our industrial inspection intelligence platform. You will work on predictive models for equipment degradation, risk-based inspection analytics, and NLP-driven knowledge extraction from technical documents. This role demands someone who can independently drive projects from problem formulation through production deployment.Key ResponsibilitiesDesign and implement machine learning models for predictive maintenance, anomaly detection, equipment degradation forecasting, and risk scoring in industrial inspection workflows.Build NLP and document understanding pipelines to extract structured knowledge from inspection reports, U1 forms, maintenance records, and technical standards (API 580/581).Develop time-series and survival analysis models for equipment remaining life estimation, corrosion rate prediction, and inspection interval optimization.Apply statistical rigor including hypothesis testing, Bayesian inference, calibration analysis, and unbiasedness demonstration to validate model performance for engineering stakeholders.Design and execute structured experiment agendas with reproducible workflows, ablation studies, and systematic hyperparameter optimization.Build RAG pipelines and few-shot prompting systems using LLMs for domain-specific extraction and reasoning tasks.Collaborate with engineering teams to integrate AI outputs into production dashboards, APIs, and decision-support tools.Work independently with fast turnaround, delivering robust solutions with minimal supervision.Essential Skills5+ years of professional experience in Data Science or Machine Learning.Strong proficiency in Python with deep experience in pandas, NumPy, scikit-learn, and PyTorch (preferred) or TensorFlow.Solid understanding of supervised, unsupervised, and deep learning models with the ability to select the right approach for each problem.Experience with time-series modeling, survival analysis, and signal processing for predictive and forecasting tasks.Strong statistical foundations: hypothesis testing, confidence intervals, Bayesian methods, model calibration, and bias-variance analysis.Hands-on experience with NLP and document understanding: text extraction, named entity recognition, embedding models, and transformer-based architectures.Familiarity with LLM integration patterns: RAG pipelines, few-shot prompting, retrieval-augmented generation, and token-aware cost management.Experience with experiment tracking and reproducibility tools (MLflow, Weights & Biases, or similar).Proficiency with SQL and relational databases (SQL Server experience preferred) for production data pipelines.Working knowledge of cloud platforms (AWS preferred: SageMaker, Bedrock, S3) and containerization (Docker).Strong problem-solving mindset with the ability to develop robust, practical solutions under time constraints.Preferred QualificationsDomain knowledge in petroleum refining, process equipment, corrosion mechanisms, RBI methodology, or NDT workflows.Experience with multimodal model architectures (e.g., cross-attention fusion, modality-specific encoders, masked autoencoder pretraining).Experience with data governance, privacy, and regulatory considerations in industrial environments.