Job Title:
Data Engineer
Company: Zyoin Group
Location: Mumbai, Maharashtra
Created: 2025-09-04
Job Type: Full Time
Job Description:
Job Title: Data Engineer (CDC / Realtime Data Integration) Location: Navi Mumbai, India Experience Level: 4–6 Years About the Role: We are seeking a skilled and experienced Data Engineer with expertise in Change Data Capture (CDC) and real-time data integration . The ideal candidate will have hands-on experience in designing, implementing, and managing real-time data pipelines using Debezium, Kafka , and related technologies. This role requires building low-latency pipelines that enable instant data availability for analytics, operational reporting, and downstream systems, ensuring data integrity and scalability in a fast-paced environment. Key Responsibilities: Design, develop, and maintain real-time data ingestion pipelines using CDC from relational and NoSQL databases. Implement and manage data streaming solutions using Apache Kafka, Kafka Connect, and Debezium. Configure and monitor Debezium connectors for accurate data capture. Develop real-time data transformation logic using Spark Streaming, Flink, or Kafka Streams. Ensure data quality, consistency, and governance across real-time pipelines. Collaborate with application teams, analysts, and data scientists to meet real-time data needs. Troubleshoot and resolve pipeline, Kafka cluster, and connector issues. Implement monitoring, logging, and alerting to ensure availability and performance. Document real-time architectures, flows, and procedures. Adhere to data governance and compliance standards. Technical Skills: Real-time Data Integration: Strong experience with CDC mechanisms. Streaming Tech: Apache Kafka, Kafka Connect, Kafka Streams. CDC Tools: Debezium (hands-on expertise). Programming: Python or Java (Spring Boot, REST APIs), Shell scripting. Big Data: Apache Spark Streaming, Apache Flink (preferred). Cloud: GCP (BigQuery, Data Lakes) or equivalent platforms. Databases: Relational + NoSQL for CDC integration. SQL: Advanced querying & validation skills. DevOps: Docker, Kubernetes, CI/CD deployment. Monitoring: Prometheus, Grafana for real-time data health. Collaboration & Soft Skills: Strong collaboration with cross-functional engineering and business teams. Ability to clearly document and present real-time data architectures . Skilled in stakeholder communication (technical & non-technical). Proactive problem solver with attention to data integrity & reliability . Adaptable, self-driven, and detail-oriented. Qualifications: BE/B.Tech and/or M.Tech in a relevant discipline. 4–6 years of Data Engineering experience with a focus on CDC & real-time data integration . Exposure to BFSI/NBFC domain is a strong advantage.