Company: Baanyan Software Services, Inc.
Location: Hybrid
Posted on: May 27
Job Title: AWS Data Engineer
Location: Richmond, VA (Hybrid/Onsite preferred) and Edison, New Jersey
Duration: Long-Term Contract
Employment Type: W2 Only (No C2C Resumes)
Visa Eligibility: OPT, H4 EAD, L2S
Education: Master's Degree / Bachelor's Degree
H1B Sponsorship: Yes
Experience: Strong Minimum 4 years plus in India and at-least 1 year in the USA
Job Description:
We are seeking a highly skilled AWS Data Engineer to join our team on a long-term contract in Richmond, Virginia. The ideal candidate will have hands-on experience building and optimizing scalable data pipelines and data lake architectures on AWS, with strong expertise in Python, PySpark, SQL, AWS Glue, and EMR. Experience with Databricks and Tableau for visualization is a strong plus.
Key Responsibilities:
- Design, develop, and maintain ETL pipelines using AWS Glue, PySpark, and AWS EMR
- Ingest, transform, and validate large volumes of structured and semi-structured data from diverse sources
- Develop scalable data lake solutions using AWS S3, Glue Catalog, and Athena
- Collaborate with Data Analysts and Data Scientists to support business intelligence and advanced analytics initiatives
- Optimize data workflows for performance and cost efficiency
- Implement and monitor data quality checks and ensure pipeline reliability
- Work with Databricks to develop notebooks, manage jobs, and orchestrate data pipelines
- Create dashboards and visualizations using Tableau for business reporting and insights
- Use SQL and Python to query, manipulate, and analyze large datasets
- Follow best practices for security, compliance, and governance of cloud-based data platforms
- Participate in Agile ceremonies and collaborate with cross-functional teams
Required Skills:
- 5+ years of experience in data engineering and cloud-based ETL development
- Strong hands-on experience with AWS Glue, AWS EMR, S3, Athena, and CloudWatch
- Proficient in Python, PySpark, and advanced SQL
- Experience with Databricks for large-scale data processing and job orchestration
- Strong understanding of data lake architecture and ETL best practices
- Hands-on experience with Tableau for data visualization and reporting
- Experience with version control (e.g., Git), CI/CD tools, and Agile methodologies
- Excellent problem-solving and communication skills
Nice to Have:
- Experience with AWS Lambda, Step Functions, or other serverless tools
- Familiarity with Redshift, Snowflake, or other data warehouses
- Knowledge of data security, encryption, and IAM policies in AWS
Thanks & Regards
Vasu
Baanyan Software Services Inc
100 Metroplex Drive, Suite 100, 1st Floor, Edison, NJ. 08817
Phone: 732-595-9006 Extn: 207 | Direct: 732-660-9077
Email: [email protected] | www.baanyan.com
An E-Verified Company