top of page
Wavy Abstract Background

SUJAN SHIROL

Data Scientist at Rolls-Royce

Research Mentor

IMG_20240714_164817.jpg

With a strong background in Data Science and Machine Learning, my work currently focuses on cutting-edge Generative AI (GenAI) development. My role at Rolls-Royce involves fine-tuning Large Language Models (LLMs), integrating Retrieval-Augmented Generation (RAG), and leveraging Vector Databases. I design and develop ML models to address real-world business challenges, conduct critical hypothesis testing, and deliver data insights to stakeholders.

As a Research Mentor at Shunya, I volunteer to guide PES University engineering students on Data Science and Machine Learning projects fostering the next generation of data science talent

  • a1d14ba7881740c8c3e428a75132a2ae
  • LinkedIn
  • images (1)
  • images
  • iconfinder_189_Kaggle_4519136
  • Tableau-Logo-for-website
Home: About Me
Home: Experience

WORK EXPERIENCE

rr.jpg

DATA SCIENCE,
ROLLS-ROYCE

August 2022 - Present

➤ GenAI development - fine-tuning Large Language Models (LLMs), developing Retrieval-Augmented Generation (RAG), Vector Database integration and LLM deployment.
➤ Designing and developing NLP models to solve real-world business problems.
➤ Constant research and implementation on cutting-edge Generative AI (GenAI) techniques.
➤ Conducting critical hypothesis testing and experiments
➤ Delivering data insights and conclusions directly to stakeholders

Models: Mistral-7B, LlamaIndex, Embedding Models, Transformers, Tensorflow and PyTorch frameworks

Skills: Data Science, Machine Learning, NLP, Generative AI, Hypothesis Testing, Research

20211214 123810_shunya.png

Research Mentor,
Shunya - PES University

Sept 2023 - Present

Volunteer mentorship for Data Science and Machine Learning projects of PES University engineering students

Skills: Teaching, Mentoring, Communication

fleek.webp

DATA SCIENCE INTERN, FLEEK

May 2022 - July 2022

End-to-end experimenting and developement of SMS and E-mail Parser: Requirement gathering, codebase of GitHub, DVC, NumPy style documentation, Sphinx documentation.

In-depth research and documentation on MLOps tools & techniques to monitor models in production.

Models: SentencePiece, Casual Language Modeling, Named Entity Recognition, Cosine Similarity Clustering

Skills: Python, PyTorch, Machine Learning, NLP, MLOps, DVC, Sphinx, Git

msys.webp

DATA SCIENCE INTERN,
MSYS TECHNOLOGIES

November 2021 - April 2022

Single-handedly built a Recommendation System for Vending Machines on 2+ million live sale data.

This project aims to help the vending machine operator to gain optimal sales by addressing three central questions: what product to stock when to stock up, and what location to stock.

Combining time-series forecasting, machine learning, and data mining techniques.

I created an ensemble forecasting algorithm for time series.

 

Models: Exponential Smoothing, ARIMA, Apriori, Association Rule

Skills: MLOps, DVC, Sphinx, NLP, Git, Machine Learning, Python

image_2021-12-15_130842.png

Technical Writer 

June 2021 - June 2022

➤ Produce high-quality, highly technical articles and other technical work in the field of machine learning & artificial intelligence.

➤ Interned for         Towards AI, a well established digital article publication company, wherein my work involved writing, coding, and developing state-of-the-art Data Science and Machine Learning tutorials like :

 

Skills: Data Science, Editing, Technical Writing, Machine Learning

towards-ai-square-circle-png.png
web-logo.jpg

PROJECT ENGINEER, WIPRO LIMITED

June 2019 - October 2020

➤ Intelligent Factory Assessment and Recommendation Tool. The tool takes an analytical approach and provides the Intelligent Factory Index, which in turn comprises Connected, Contextual, Optimized, and Self Aware Indices.

➤ Coordinated with hardware and system engineering leads to gather and develop system requirements.

➤ Designed, implemented, and monitored web pages and sites for continuous improvement.

Skills: Python scripting, Statistics, analytics, Flask framework.

EDUCATION

Home: Education

July 2020 - August 2022

MASTERS OF TECHNOLOGY IN DATA SCIENCE AND MACHINE LEARNING,
PES UNIVERSITY

Graduated

Achievements: Thesis work in partnership with MSys Technologies on an experimental project.

SGPA: 9.36

August 2015 - July 2019

BACHELOR OF ENGINEERING IN COMPUTER SCIENCE,

RNS INSTITUTE OF TECHNOLOGY

Visvesvaraya Technological University
CGPA: 7.1
No backlog history.

January 2013 - March 2015

SENIOR SECONDARY EDUCATION IN SCIENCE,

VISION PU COLLEGE

Karnataka State Board
Physics, Chemistry, Mathematics, and Biology
Score: 74%

June 2011 - March 2012

Central Board of Secondary Education
CGPA: 8.6

pngfind.com-python-logo-png-2301392.png

Python

Two years of industry experience. I'm friends with Python for 5+ years.

pngegg (2).png

Data Analytics

Visualizing data with Tableau. Business insights with Pandas, Seaborn, Plotly.

pngegg (7).png

Natural Language Processing

Proficiency in TensorFlow and PyTorch, Gensem, NLTK. Projects on Text analysis, text classification, text generation

Screenshot_2024-08-30_185740-removebg-preview.png

Generative AI

Generative AI, GPT, RAG, Fine-tuning LLM, Deployment using vLLM

pngegg (6).png

Machine Learning

Worked on various projects involving different ML algorithms including regression, classification, recommendation. 

pngegg (5).png

Deep Learning

Proficiency in TensorFlow and PyTorch. Projects on CNN, GAN, Object Detection/Localization

pngegg (1).png

Statistics

Built projects to make statistical business decisions. Published simple tutorial to make Hypothesis testing easier.  

pngegg (3).png

SQL

Hands-on MySQL Workbench. Projects demonstrating complex queries and window functions.

pngegg (8).png

Web Development

Industry experience in working with Flask, HTML, CSS, Jquery, JSON, and AJAX.

Home: Skills

WHAT I CAN OFFER YOU

Bar Chart

PROJECTS

Home: Projects

FEATURED BLOGS

Home: Blogs
1_Le0zcOjAWy6vXOuBBBC2mQ.jpeg

This tutorial will be diving into genetic algorithms in detail and explaining their implementation in Python. We will also explore the different methods involved in each step diagrammatically. As always, we are including code for reproducibility purposes. We have split the code when required while exploring the different steps involved during our implementation.

Home: Resume
bottom of page