Shubham Pratap Singh

Magdeburg, Germany | +49-17686049487 | shubhamp.singh@yahoo.com

Data Scientist with 3+ years of broad experience in building data-intensive applications. Experience in predictive modeling, data processing, and ML algorithms, as well as scripting languages like Python. Passionate about building models that fix problems. Actively looking for internships and full-time roles in the field of Machine Learning and Data Science.


Skills

Programming Languages & Tools
Extra skills / knowledge
  • 4+ years of experience in writing Python code to pre-process and analyze data to generate actionable insights.
  • Experience in implementing and operating end-to-end machine learning and data processing pipelines.
  • Sound knowledge of Kubernetes, Kafka, Pyspark, Git, Docker, basic cloud skills (AWS), and experienced in agile projects.
  • Good understanding of basic Machine Learning Techniques (SVM, Logistic Regression etc) and deep learning algorithms (CNN, LSTM, Transformers)
  • Experience in data science tools (e.g. pandas, sklearn, pytorch, keras)
  • Image processing and computer vision (OpenCV, Mediapipe)
  • Text processing and analysis (NLTK, Spacy, Transformers)
  • Hand on experience with using MLOps tools to build Machine Learning production pipelines and model deployment.

Areas of interests
  • Machine Learining / Data Science / Deep Learning
  • Computer Vision
  • Natural Language Processing (NLP)
  • Time series analysis
  • Medical Imaging
  • Generative models - VAE, GAN’s

Experience

Data scientist (werkstudent)

Pixsy (Berlin, Germany)

Working with image similarity and fine-tuning of in-place Deep learning models. Recenlty built an interface for template matching.

June 2021 - Present

ML Engineer

Accenture AI (Bangalore, India)

Developed ML based failure prediction models to assess failures of trade transactions. Also formalised and built a prediction tool for estimating the runtime of crucial batches of transactions.

November 2018 - August 2019

Assistant Application Analyst

Accenture (Pune, India)

Performed exploratory data analysis (cleaning, analysis, visualization) on the sales data of the retail client. Built an email classification and automatic support ticket assignment model. Created optimized and efficient SQL queries.

July 2016 - November 2018

For a detailed reference about this section, please visit this link.


Education

Otto-Von-Guericke Universität (Magdeburg, Germany)

Masters of Science (M.Sc.), Data and Knowledge Engineering (Data Science)
Machine Learning | Introduction to Deep Learning | Learning Generative Models | Computer Vision and Deep Learning | Recommenders | Swarm Intelligence | Visual Analytics | Data-Warehouse-Technologies

October 2019 - Present

Guru Gobind Singh Indraprastha University (Delhi, India)

Bachelors of Technology (B.Tech), Electronics and Communication Engineering
Data Structures | Introduction to Programming (C++) | Software Engineering | Applied Mathematics

August 2012 - May 2016

Community and social activity

Community Builder ('Data Crunch')

MSIT
Spearheaded a coummunity of like-minded people having interst for the domain of Data Science
September 2014 - May 2015

Mentor for international students

OVGU, Informatics Dept.
Helped new incoming students with the understanding of the German education system and raised their concerns/issues to the faculty
Know more
May 2021 - Present

Co-lead, The Podcast Initiative

OVGU
Volunteered and manged a small team and conducted sessions related to jobs and hiring for students of Data Science
Know more
November 2020 - Present

Projects

This section contains awesome projects that I've developed:

Stock price prediction

Stock price prediction

Stock price prediction

Predict the stock prices based on historical stock prices of an organization and sentiment of tweets about the organization.

Python Time-series analysis Sentiment anaylsis API-handling

Data Science Market Analysis

Data Science Market Analysis

Analyze Skills and Backgrounds of Data Scientists From 1 Million Data-Related Jobs

Overall market analysis of Data science is done (skills, education, degree, gender, location) in comparison to other data related jobs.

Python NLP Text Analysis Visualisation

Linkedin Analysis

Linkedin Analysis

Linkedin connection and Meassges analysed

Analysed my Linkedin connections (their job roles, companies they work in etc) and also the messages I received (i.e sentiment and frequently used words).

Python NLP Sentiment anaylsis Text Analysis Visualisation

YOLO

YOLO

Real-Time Object Detection

Perform fast and real-time object detection on your own data on colab. YOLOv3 is extremely fast and accurate.

Python Computer Vision Object Detection Visualisation Segmentation

Medium articles anaylsis

Medium articles anaylsis

What I Learned from Scraping 15k Data Science Articles on Medium.

Analysing medium articles metadata to get insights about what factors makes an article a good medium artcle.

Python NLP Text Analysis Visualisation WordCloud

GOT characters graph

GOT characters graph

Visualising the relationship between GOT charcters

After reading the books in the series 'A Song of Ice and Fire' by G. R. R. Martin, as a true fan of Game of Thrones, you might be curious about who is the most influential person in Westeros. Or you know that Eddard Stark and Randyll Tarly are connected but not quite sure how exactly they are connected. Are they connected by a third, or fourth person?

Python Graph theory Visualisation

End to end data pipeline

End to end data pipeline

Implemented data pipeline using Spotify API, Apache Kafka, Apache Spark Streaming and Streamlit

Created an end to end data pipeline using Apache Kafka and Apache Spark Streaming. Also performed song recommendation on the songs dataset retrieved from Spotify and built a dashboard using Streamlit.

Python Spark Kafka Data Engineering API Dashboard


This portfolio is built with by Shubham Pratap Singh.