🚀 ARIJIT SAMAL

I'm a Data Scientist

ML|AI|NLP|DL|Gen-AI|LLM|Big Data
Hire Me
Arijit Samal - Data Scientist & AI Engineer

About Me

My Introduction

Results-driven data scientist with expertise in big data, machine learning, deep learning, and Gen-AI. Proficient in Python, Big data technologies, PyTorch, and LLMs with a strong background in developing data-driven solutions across various domains.

My expertise spans across healthcare, e-commerce, social media, and IoT domains. I have proven ability to design and implement comprehensive data analysis, NLP, Knowledge Graphs, Deep Learning, and Computer Vision models.

Demonstrated success in hackathons, achieved prestigious scholarships, conducted impactful research, and contributed to open-source projects, showcasing strong problem-solving, collaborative, and analytical skills with a commitment to innovate and impact using data science.

Skills & Technologies

C/C++
Python
Java
OOPs
DSA
NumPy
Pandas
Seaborn
Matplotlib
Scikit-learn
PyTorch
TensorFlow
ML
DL
Data Analytics
Data Visualization
OpenCV
Langchain
Agents
LangGraph
Streamlit
LLM
SQL
PostgreSQL
Airflow
PySpark
Docker
Shell
GCS
Minio
Big Data
SPARQL
Cypher
Neo4J
GraphDB
OrientDB

Education

Erasmus Mundus Masters in Big Data Management and Analytics (BDMA) logo
Masters Program

Erasmus Mundus Masters in Big Data Management and Analytics (BDMA)

2023 – Present
Université libre de Bruxelles (ULB) logo
Semester 1

Université libre de Bruxelles (ULB)

Brussels, Belgium
MS in Computer Science
Universitat Politècnica de Catalunya (UPC) logo
Semester 2

Universitat Politècnica de Catalunya (UPC)

Barcelona, Spain
MS in BDMA
CentraleSupélec (CS), Université Paris-Saclay logo
Semester 3

CentraleSupélec (CS), Université Paris-Saclay

Paris, France
M2 in BDMA
Indian Institute of Science Education and Research (IISER), Bhopal logo
Bachelor's Degree

Indian Institute of Science Education and Research (IISER), Bhopal

2019 – 2023
Bhopal, India
Bachelors in Electrical Engineering and Computer Science (EECS)
Minor in Data Science and Engineering (DSE)
G.P.A.: 9.57 / 10.00

Professional Experience

Hedge Fund

Capital Fund Management (CFM)

Data Scientist

March 2024 – Present
Paris, Ile de France, France

Building a multi-agent system to fix alerts in the data equity referential pipeline of CFM which is an essential part of the intraday trading workflows.

Technologies

PythonOracleDBLangChainLangGraphGoogle ADKFastAPIStreamlit

Key Achievements

Built a multi-agent system to auto resolve data pipeline alerts for the Data Referential Equity team
Engineered an ensemble RAG pipeline (Graph RAG, RAPTOR, hybrid search, reranking, Agentic RAG)
Created autonomous coding agents using ReWOO, LLMCompiler, and LATS planners
Implemented an agentic swarm orchestration with dynamic handoffs between agents
Achieved 97% RAG accuracy and 94% agent performance on production CFM alerts
Reduced mean time to resolution from 60 min to 2 min (fast) / 5 min (normal), cutting handling time by 92%
Open Source

MobilityDB

Open Source Developer

July – September 2024
Brussels, Belgium

Improved JMEOS, Java binding for the MEOS spatiotemporal library, contributing significantly to the open-source geospatial database ecosystem.

Technologies

CJavaFFICI/CDGitHub ActionsPython

Key Achievements

Contributed 30K+ lines of code to JMEOS and MobilityDB repositories
Boosted testing coverage by 70% using JUnit for MEOS data types
Automated documentation deployment using GitHub Pages, streamlining API visibility for 500+ users
Built CI/CD pipelines with GitHub Actions, cutting build and integration times by 30%
Research

Health Technologies Lab (HTL), IBME, University of New Brunswick (UNB)

Deep Learning Researcher

May – October 2023
Fredericton, Canada

Worked on Translating Foot Pressure Maps to 3D Human Poses, developing innovative biometric identification systems using advanced machine learning techniques.

Technologies

PyTorchPythonMediapipeTensorFlowKerasOpenCVMATLAB

Key Achievements

Captured foot pressure maps using 100Hz tiles; mapped to 3D poses with 33 keypoints
Used video from 8 cameras as supervision; developed Encoder-Decoder, CRNN, and CNN+LSTM models
Evaluated models using MPJPE and MSE, enabling non-invasive person identification with 95% accuracy
Pioneered novel approach to biometric authentication using gait analysis

Research Publications

Published

Research Paper

December 2022 – August 2023
Elsevier ScienceDirect Smart Health Journal
Arijit Samal, Haroon R. Lone

Thermal Vision: Pioneering Non-Invasive Temperature Tracking in Congested Spaces

Co-authored paper as part of Bachelor's thesis, developing real-time temperature tracking in crowded environments using edge devices.

Technologies Used

PythonOpenCVTensorFlowKerasPyTorchScikit-learnYOLOIoT
View Publication

Featured Projects

Personal Project
Completed

OASIS OS

Agents, GUI-Automation, LLM, GEN-AI,

2024

Intelligent Workflow Automation Platform that transforms workspaces with AI-powered automation. Teach OASIS OS your workflows once, and watch it handle repetitive tasks forever. Features teach mode recording, cross-platform GUI automation, and modern web interface.

Technologies

PythonNext.jsFastAPIGPT-4.1GroqOllamawhisperTypeScriptLangGraphLangChain

Key Achievements

Teach Mode: Record workflows with mouse/keyboard tracking and voice commands
Cross-platform GUI automation (Windows, macOS, Linux) with visual debugging
Extensible architecture with FastAPI backend and multiple AI model support
Smart automation with workflow execution via simple commands
Voice command integration with speech-to-text
1 / 6

Get In Touch

Contact Information

Available for opportunities

Open to full-time roles and exciting opportunities

Send me a message

Ready to work together?

I'm always interested in hearing about new opportunities, innovative projects, and ways to contribute to cutting-edge research in AI and ML.

Let's Talk