Dedicated and experienced Data Engineer with a strong background in researching, developing, and deploying machine learning projects and infrastructure at scale. Worked primarily on e-commerce and classified ads domains, with additional experience in finance. Experienced in search engines, recommendation engines, natural language processing, time series data, deep learning, backend engineering and distributed systems with a balanced blend of research and practical application.
Languages: Python, Scala, Go, Java
Streams: Kafka, AWS Kinesis, GCP Pub/Sub
Data Tools: Apache Spark, GCP DataFlow, AWS Athena, AWS EMR
Frameworks: Apache Beam, Flask, PySpark, Akka, Akka-HTTP, Akka-Streams
DevOps: GitLab, Jenkins, GitHub Actions, Terraform, Apache Airflow
Data: Delta Lake, Cassandra, PostgreSQL, Redis, Milvus, GCP BigTable, ElasticSearch, Amazon Athena, GCP BigQuery, ClickHouse
Observability: Redash, Looker, ELK Stack, Grafana
Cloud Platforms: AWS, GCP
Machine Learning: AWS SageMaker, MlFlow, PyTorch