Position Overview
We are seeking a motivated Data Engineer to join our team in Bengaluru. This role is ideal for fresh graduates or early-career professionals eager to build expertise in data engineering, ETL processes, APIs, and large-scale data systems.
Key Responsibilities
- Assist in creating and maintaining data connectors and ingestion pipelines for multiple data sources including APIs, databases, files, and SaaS platforms.
- Support ETL/ELT processes to load data into the platform.
- Understand various source system schemas, APIs, and data formats such as JSON, CSV, and relational tables.
- Implement basic data transformations, validations, and quality checks.
- Write clean, maintainable, and testable code under the guidance of senior engineers.
- Conduct unit and basic integration testing of data pipelines and connectors.
- Assist in troubleshooting data issues, pipeline failures, and synchronization problems.
- Document connectors, pipelines, and operational procedures.
- Utilize Git and CI/CD pipelines for version control and deployments.
- Adhere to engineering best practices, coding standards, and data governance policies.
- Collaborate closely with senior data engineers, QA teams, and product stakeholders.
Required Qualifications
- Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related discipline.
- Basic to intermediate proficiency in Python and SQL.
- Understanding of core data engineering concepts such as ETL, data pipelines, and data warehousing.
- Familiarity with REST APIs and working with JSON/CSV data.
- Basic knowledge of databases like MySQL or PostgreSQL.
- Awareness of Git and version control workflows.
- Strong problem-solving skills, attention to detail, and eagerness to learn.
- Effective communication skills.
Preferred Skills (Nice to Have)
- Exposure to PySpark, Spark, Databricks, or AWS EMR.
- Basic knowledge of cloud platforms such as AWS, Azure, or Google Cloud.
- Understanding of CI/CD methodologies.
- Experience through internships or academic projects in data engineering, backend development, or ETL.
- Awareness of data modeling and data warehousing principles.