Data Engineering

 Services

Data Infrastructure

Data Processing

  • Real-time and batch data pipelines & processing

  • Data quality and standardization services

  • Data integration & maintenance services

  • Data Lakes and Data Warehouses setup

Data Analytics

  • Consultancy and preparation of a plan for analytical method development

  • Search/Recommendation systems

  • Quality evaluation of organizations’ analytical products

  • Trend and Pattern Analysis

  • Data Lake design and implementation

  • Deployment with multiple models: cloud, on-prem and hybrid

  • Real-time and batch data processing for AI/ML use-cases, dashboards

  • Design optimization of database and data warehouse platforms

Technology Toolset

 

 

Cloud Solutions

  • Redhsift

  • Athena

  • SQS/SNS

  • S3

  • DynamoDB

  • AuroraDB

  • EMR

  • Glue

 

Opensource Tool/Frameworks

  • Hadoop, Hive

  • HBase

  • Spark

  • Airflow

  • Oozie

  • Storm

  • Kafka

  • Sqoop

  • Postgres/Mysql

  • ElasticSearch

 

Programming

  • Python: numpy, pandas, matplotlib, scikit-learn, scipy,

  • spark, pyspark

  • Java

  • SQL, T-SQL, H-SQL, PL/SQL