Sourav Roy

Sourav Roy

@SouravRoy-ETL

I architect pipelines you forget exist.

Whoa! Somewhere in this Planet!
32
Followers
7
Following
30
Public Repos
0
Private Repos

Language Breakdown

Lines of code distribution across 17 owned repositories

8.0M Total LOC
C++
2,587,434 lines
32.2%
N/A
Rust
1,876,270 lines
23.4%
N/A
TypeScript
1,142,697 lines
14.2%
N/A
Java
1,038,007 lines
12.9%
N/A
Python
744,262 lines
9.3%
N/A
Other
634,385 lines
7.9%
N/A

Generalist Developer

G-shaped

Versatile across many languages and paradigms

C++
Rust
TypeScript
Java
Python

Collaboration Network

Global Impact visualization

LIVE
Sourav Roy
0 active collaborators

Repos

30

PRs

0

Growth

+18%

Top Collaborators

No collaborator data yet.

Coding Streak

Contribution activity over the past year

13 days
5,471
Contributions
1,070
Commits
17
Pull Requests
Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun
Mo
We
Fr
Based on GitHub activity
Less
More

Top Repositories

slothdb

An experimental embedded SQL engine in C++20. Query Parquet, CSV, JSON, Arrow, Avro, SQLite, and Excel files directly with SQL, in-process. Early-stage.

480 5
C++
duckle

Local-first ETL/ELT studio: a drag-and-drop visual pipeline designer that compiles to SQL and runs on DuckDB. Tiny desktop app, no servers, git-friendly workspaces.

443 28
Rust
GCP-data-modelling-from-YML

Creation of Data Modelling Sheet in an automated fashion from GCP Composer Airflow Variable setters. This activity has reduced dependency of Data Modelling Team to prepare Mapping Sheet from 100% to 0%.

2 0
Python
Talend-quality-control-framework

Talend Quality Control Framework is an automated code analysis tool to motivate Talend developers and ETL Testers to design Talend Projects that adheres to a Strong and Solid coding standard.

2 0
Java
GCP-BigQuery-auto-ddl-generator

Creating automated Google Cloud Big Query DDL Statements for Table Creation. This activity has reduced the manual efforts by 100% to create BQ DDL Insert/Append Queries. This is a Dev/Test only activity/for personal use only.

2 0
Python
automated-view-ddl-creation

Creating automated Views for use in Aggregation Layer of any reporting tool(Power BI, Qlik, QuickSight) and to reduce manual effort of writing the views to nil. This activity has reduced resource focusing on writing SQL Queries by 100%. The Views are created from a CSV file as an input and the Output is the SQL View File and can be run in any Google BigQuery Instance.

2 1
Python
GCP-airflow-variable-setter

Creating automated YML to set Variables in Apache Airflow (Composer) to reduce manual effort to 0. This activity has reduced resource focusing on Manual Tasks by 99%. Fault tolerance is 1% in case of wrong input passed by triggers.

2 0
Python
awesome-db

A curated list of amazingly awesome database libraries, resources and shiny things by https://www.numetriclabz.com/

1 0
taldbt

Convert legacy Talend ETL to modern dbt SQL using semantic AI transpilation.

1 1
Python
Talend-to-DBT-DuckDB-TempralIO-Agent

The Talend-to-dbt Migration Agent is not just a code translator; it is a logic refactoring engine. It parses the deep semantic structure of Talend jobs to ensure 1:1 behavioral parity while modernizing the codebase.

1 1
Python