Work Experiences

2024-08 - Present

Current

Data Engineer

📍 Huntsville, AL, USA

NASA IMPACT (CSDA Program)

  • Migrated Airbus and BlackSky ingestion workflows into centralized Airflow pipelines with partitioned storage and STAC item generation.
  • Built Pytest and validation workflows to enforce data and transformation correctness.
  • Automated MAXAR checksum verification through Airflow + DynamoDB + Athena, eliminating manual validation overhead.
  • Modernized and optimized legacy DAGs to reduce pipeline runtime and improve reliability.
PythonApache AirflowPySTACAWS DynamoDBAWS AthenaSTAC

2022-05 - 2024-07

Mid-Level Data Engineer

📍 Lalitpur, Nepal

GrowByData Services

  • Owned end-to-end POC to reporting pipelines for LeadGen and Amazon data sources.
  • Re-architected social and SEO ingestion pipelines for multithreaded acquisition with 2-5x performance gains.
  • Migrated Redshift-heavy architecture to lakehouse model and reduced compute costs by over 2x.
  • Optimized reporting latency with materialized views and Redis cache integration.
  • Mentored junior engineers and interns on ETL design, dimensional modeling, and data platform fundamentals.
PythonSparkAWS S3Glue CatalogDremioRedisSQL

2021-05 - 2022-05

Associate Data Engineer

📍 Lalitpur, Nepal

GrowByData Services

  • Built first production Pinterest Ads acquisition pipeline with OAuth2 integration.
  • Stabilized Facebook Ads ingestion under throttle constraints.
  • Resolved ETL production issues while building depth in star-schema and SCD modeling.
  • Migrated Talend flows to distributed Spark, reducing ETL latency and Redshift CPU load significantly.
PythonScalaSparkSQLRedshiftOAuth2