Hello, I'm Abhilash B S

Skills

  • All
  • Datastores
  • Tools
  • Languages

Apache Nifi

Moderate

Apache Spark

Advanced

Apache Kafka

Advanced

Apache Oozie

Moderate

Debezium and confluent kafka connectors

Advanced

Docker

Advanced

Metabase

Moderate

Azkaban

Moderate

Apache Zookeeper

Basic

Scylla

Advanced

Cassandra

Advanced

Postgres

Advanced

Redshift

Advanced

Bigquery

Advanced

Redis

Moderate

Janusgraph

Advanced

Mysql

Basic

C/C++

Moderate

Java

Moderate

Go

Moderate

Rust

Basic

Scala

Moderate

Python

Advanced

Resume

Education

August 2021 - May 2023

Masters of Science in Computer Science (CGPA: 3.85/4)

  • Courses: Analysis of Algorithms, Database Systems, Operating Systems, Innovation in Defense Systems, Advanced mobile devices and game consoles, Foundation of data management, Information Retrieval

University of Southern California, Los Angeles, USA
June 2014 - July 2018

Bachelor of Engineering (CGPA : 9.33/10)

  • Courses: C Programming, Data Structures, Programming with C++, Web Programming, Computer organization and Architecture, OOPS with Java, Database Management systems, Operating systems, Computer Networks, Analysis and Design of Algorithms, Internet of Things, Software Engineering, Artificial Intelligence, Neural Networks, Big Data and Analytics

B.M.S College of Engineering, Bengaluru, India
June 2014 - July 2018

12th (CBSE : 92%)

Teen mostly living in the world of math and physics.

Kendriya Vidhyalaya CRPF ,Bengaluru, India
June 2014 - July 2018

10th (CBSE Board CGPA : 10.00)

Kid enjoying the school life.

Kendriya Vidhyalaya CRPF ,Bengaluru, India

Experience

Sept 2022 - Feb 2023

Site Reliability Engineer

  • Managing operations of YARN and Kubernetes clusters of size of ~11000 nodes and ~15 PB of memory in multiple datacenters
  • Deployed and owned Spark, YARN, and various other batch components on a new data center
  • Enabled SPARK parameter auto-optimization for jobs wrt shuffle partition splitting, long tailing skewed tasks, OOM self-adaption, and YARN shuffle server application priority through a rule based-engine improving resource utilization of jobs by 20-30%
  • Integrated control layer for job submission to level-2 schedulers such as YARN, K8 to have unified resource resource pool, global virtual queue for compute and storage quota management, global scheduling based on data locality, DC room load and data access latency

TikTok U.S. Data Security
Sept 2022 - Feb 2023

Data Engineer

  • Configured Nix shell environment for multiple python projects and crafted reproducible container images utilizing Poetry and Nix packages for GitHub Action workflows
  • Enhanced Airflow DAGs using custom context managers to ensure atomic Bigquery updates for data rate of 1B records/hr

Chartboost
Sept 2022 - Feb 2023

Senior Data Engineer

  • Conducted data analysis on event datasets for data understanding and having metrics to debug Flink applications
  • Evaluated the feasibility of running Apache spark jobs to process TV events on AWS EMR Serverless with documentation

Samsung Research America (WorldLink)
Nov 2019 - July 2021

Data Engineer

  • Building and maintaining data flows which involves data of tera-byte scale and graph operations like connected components etc over a billion-node sized graph.
  • Contribution and optimization of scylla database backup and recovery tool.
  • Building utility products to help maintain several tens of big data jobs in a better manner, track failures and get summary reports.

Zeotap India Pvt Ltd, Bengaluru, India
June 2018 - Nov 2019

Data Engineer

  • Maintaining warehouse and building ETL pipelines for batch and real-time data replication from OLTP databases to OLAP database.
  • Building and maintaining data pipelines for various application-specific operations like updation of caches via change data capture.
  • Building and maintaining various reports which provides insights over day-to-day business operations.

    Endurance International Group (also known as Directi), Bengaluru, India
Jan 2018 - Apr 2018

Software Engineering Intern

  • Integration of various third party applications and addons for hosting product in the new platform webpro panel for resellers.

Endurance International Group (also known as DirectI), Bengaluru, India
Jun 2017 - Jul 2017

Data Science Intern

  • Designing and building the variation of CNN that extracts the visual features from the product visuals.
  • Building the DS model on extracted visual feature and the sales data for sales forecast based on product sales decisions.

Affine Analytics, Bengaluru, India

About Me

I am a passionate techie who loves problem solving. My area of specialization towards architecting, building and maintaining data systems.

View Resume

Recognition

Harley Keital Award

This was awarded for reducing the operational overhead of various data flow jobs at EIG.

Winner@ Geek of Thrones

Best project for this Intra-company competition at EIG. Had built plugins for Atlassian products to integrate it with google chat.

Winner@ Value addition category at InnoJam

Best project under value addition category at yearly intra company competition Innojam at EIG. The project was Customer clustering based on buying patterns for better suggestion and threshold based alert based on product usage for upgradation suggestion.

Medium Blog

Image placeholder

Diving deeper into tech (yet to come)

Blogs to increase clarity in understanding of various technologies.

Get In Touch

Get In Touch

My Contact Details

  • Email abhilashbss@gmail.com
  • Phone +1 213-573-9659, +91 953-876-5521