Jerome Rajan

0 %
Jerome Rajan
Staff Solutions Consultant at Google
Data & Analytics
  • Residence:
    India
  • City:
    Mumbai
SQL
Dataproc, EMR
Hadoop
BigQuery
AWS Glue
PySpark, Python
Data Pipeline Design
Tableau, Redshift, Snowflake
IBM DataStage
  • AWS Lambda, S3, EMR, SQS, DynamoDB, Step Functions, Cloud Functions
  • Unix Shell Scripting, Python
  • Oracle, DB2, Redis
  • Alteryx, VBA, Blueprism, UiPath
English
Tamil
Hindi
Malayalam
Marathi

Tag: Spark

December 15, 2023 / Technology
Spark Config Calculator

The Spark Configuration Tool is a Streamlit-based application designed to assist users in optimizing Apache Spark configurations. It allows users…

Spark Architecture – Notes & FAQ
July 29, 2021 / Technology
Spark Architecture – Notes & FAQ

I’ve spent the past couple of weeks trying to master the Spark Architecture and this post is a running summary of all my notes and questions gathered from across the internet & Stackoverflow. If you feel something is incorrect, I’ll be happy to discuss. Hope you find it useful!

The All New AWS Glue Studio
October 7, 2020 / Technology
The All New AWS Glue Studio

Up until now, AWS provided a visual representation of your code but never really allowed you to build using a…

Real Time Data Streaming Into Kinesis & Ingestion Into Postgres Using AWS Glue – Part 2 (Configure Glue Catalog Tables)
October 6, 2020 / Technology
Real Time Data Streaming Into Kinesis & Ingestion Into Postgres Using AWS Glue – Part 2 (Configure Glue Catalog Tables)

Before we start building Glue jobs, we need to understand that one of the unique features of Glue is its…

Real Time Data Streaming Into Kinesis & Ingestion Into Postgres Using AWS Glue – Part 1 (Setup)
October 5, 2020 / Technology
Real Time Data Streaming Into Kinesis & Ingestion Into Postgres Using AWS Glue – Part 1 (Setup)

This April, Amazon announced support for serverless streaming ETL using AWS Glue. For the uninformed – AWS Glue is built…

AWS – Develop ETL jobs using AWS Glue Endpoints
October 3, 2020 / Technology
AWS – Develop ETL jobs using AWS Glue Endpoints

AWS Glue scripts can have start up times that could be as long as 12 minutes especially if you are…

Be Original
Would the boy you were be proud of the man you are?