Professional Summary
Specialist Data Scientist
June,2021- Present, Gurgaon​
-
Involve in deployment and deliveries for Human potential AI related solutions like AI HR Transformation POCs, AI Talent selection Assests Development, Talent Reskilling Product Development for various Global Clients using Advanced NLP Techniques, AWS and BitBucket.
Senior Data Scientist
Sep,2020- June,2021, Bangalore​
-
Worked on various verticals of continuous learning models using Custom Machine Learning Models, Advanced Deep Learning Models, Advanced Computer Vision, Python, Dockers, APIs, AWS Cloud, Postman and Kubernetes.
Data Scientist & Analyst
July,2019- Present, Gurgaon​
-
Provided end to end data driven business solutions that required simplifying the business functions and supported organizations in drafting and allocating resources necessary to attain their long and short terms goals.​
-
Actively contributing in data science community via writing articles, helping aspirants for learning data science smoothly through my projects, github free of cost. Helping enthusiast for their machine learning projects, try to make data science learning process smooth for evangelist.
Sales & Pricing Analyst
Feb,2019- June2019, New Delhi​
-
Performed time series analysis for enhancing market cake size,able to change the sales cake size with 200% growth and also responsible for branding management, training management and business management of area.
-
Performed descriptive statistics and visualization on the sales data gathered from various market sources for developing and Implementing New Marketing and Sales strategy, Risk management and Sales Forecasting.​
Technical Intern
June/2016- Aug/2016, New Delhi​
-
Developed the prototype with the practical world of electrical equipment, PLC -SCADA, Relays, Circuit Breakers, and also play a major role in transformer oil testing with DMRC staff.And this internship has been a magnificent and rewarding experience.
Projects
Parkinson Disease ANALYSIS & PREDICTION using Voices
Project Aim: Predicted the person diagnosed as Parkinson or healthy using voice recording from mobile phones.
Description: Collected various voice recordings from different geographical regions and extracted the important features from voices using Parsel mouth prat package, moviepy package and built a machine learning based solution for classifying disease.
Key-skills: Data exploration, Feature engineering, Model creation, Feature selection, Model tuning and deployment.
AI Solutions for disease classification from Camera images
Project Aim: Predicted the various tongue, dental and skin diseases from selfie image for reducing the health problems initially.
Description: Collected images of various diseases and healthy based on the body part and built the transfer learning custom models for classifying the diseases by clicking the selfie images from app.
Key-skills: Data collection, Data Exploration, Transfer Learning, Open Cv, Dockers, APIs .
Achievement: All use cases like tongue, skin and age prediction went to production within 15 days.
US VISA PREDICTION AND DETAILED ANALYSIS
Project Aim: Predicted the Visa approval and lead to increase market share of OES.
Description: Resolve problem of visa process decisions based on employee/employer/wages and detailed analysis of influential factors for increasing productivity and cost cutting for training process.
Key-skills: Machine Learning, Python, Feature Engineering, Tableau, MYSQL, AWS, web-scraping.
SPAM FILTER USING NLP & LSTM
Project Aim: Automating the filtration of Spam from given data for increasing readability.
Pipeline created: Data Collection, Feature Engineering, Feature Section, Modeling and Deployment.
Key-skills: Deep Learning, Python, LSTM-RNN, Feature Engineering, Keras, NLP, FLASK, Heroku.
ASHRAE ENERY PREDICTION AND ANALYSIS
Project Aim: Predicted the Energy Consumption in KWH for various site ids around 1450 buildings.
Description: Helped ASHRAE in energy savings from Retro-commissioning measures around the world serving in 132 countries.
Key-skills: Data exploration, Feature engineering, Model creation, Feature selection, Model tuning
​COVID-19 INDIA PANDEMIC ANALYSIS & DASHBOARD
Project Aim: Automating the support process and Covid-19 cases review analysis.
Description: Successfully conducted visualization and detailed analysis of Covid-19 around all states.
Key-skills: Data exploration, Python, Feature engineering, Dashboard, Filters and Storytelling.
HOUSE PRICES ANALYSIS & PREDICTION
Project Aim: Analyse influential factors for the prices of the house.
Description: Help real estate markets for predicting property prices which are good indicator of both the overall market condition and the economic health of a country and house value is simply more than location and Area.
Key-skills: Data exploration, Data cleaning steps, feature engineering basics and model selection using SCIKIT Learn library
CHATBOT: A CHATBOT THAT CAN DEFINE ME !
Built a chatbot using RASA-NLU that can hold a meaningful dialog with humans natural, conversational language about me and my professional and personal summary. Specifically, trying to change a way of resume optimization and resume shortlisting process.
AUTO EDA TOOL USING IF-ELSE
Built a simple EDA Tool and deployed in Heroku using Stremlit for simplifying and automating the exploratory data analysis process from scratch.Just need to put the data in any file format like csv,tsv and based on analysis it gives the output as per expectations.
MARKET ANALYSIS OF WEST DELHI FOR LOW SALES ​
Project Aim: Lend to increase the sales by 13% & saved company expenditure by 2% than previous.
Description: Lend a helping hand to OPPO Mobiles to resolve problem of dropped sales by 37%. Collected Messy Unstructured data from various market sources in order to found influential factors.
Key-skills: Data collection from various sources, Data Visualization, Team-skills, MS-Excel, Data Exploration and analysis, Time Series Analysis, Forecasting and Prescriptive analysis.
Achievement: Awarded with 3rd position at OPPO Market Research camp.
Educational Summary
Post Graduation in Data Science & Eng
Specialization in Machine Learning,Deep-Learning,Python, Tableau, MYSQL, AWS,FLASK
B.Tech [ Electrical & Electronics ]
Secured 4th rank at IIT Roorkie for RoboWar and Campus Ambassador of IIT guwahati and also a President of Event management Committee of college and organized various technical and non technical events.
Certifications
Ineuron
Stats for Datascience by ineuron
​
ineuron Power BI & Tableau Master
​
ineuron Python for Datascience
Udemy
Ms-Excel by Udemy
​
Web Scraping with Python by Udemy
​
Python by Udemy
LinkedIn Learning
NLP with Python for Machine Learning
​
DevOps for Data Scientists
​
Docker for Data Scientists
Great Learning
Intro to Artificial Intelligence
​
Data Visualization using Tableau
​
Excel for Beginners
Badges
Tableau Analyst Badge
​
Tableau Data Scientist Badge
​
CutShort Certified Data Science
Publications
Disclosing Natural Gradient Boosting
This algorithm includes uncertainty estimation into the gradient boosting by using the Natural gradient.which is a fast, flexible, and easy-to-use algorithm for probabilistic regression.
See Publication​
RIP Pandas: Time to introduce the Vaex
An alternative to the Pandas that take less time on a huge data and Out of Core Dataframe for python & Fast Visualization.A highly viewed article on Analytics-Vidya.
See Publication​
Pycaret: An open sourceAuto ML Library
Automated machine learning (AutoML) basically involves automating the end-to-end process of applying machine learning to real-world problems to generating ML solutions for the data scientist without having to do endless searches on data preparation, data cleansing, model selection,Model building that are actually relevant in the industry.
See Publication​
RIP Pandas 2.0: Time For DASK !!!
An open source flexible library for parallel computing in Python that gives us abstractions over NumPy Arrays, Pandas Data frames and regular lists, allowing you to run operations on them in parallel.
See Publication​
Natural Language Understanding
NLU is Artificial Intelligence that uses computer software to interpret text and any type of unstructured data.
See Publication​
Teachable Machine with Google
Teachable Machine is a web-based tool that makes creating machine learning models fast, easy, and accessible to everyone.Teachable Machine is flexible — use files or capture examples live. It’s respectful of the way you work. You can even choose to use it entirely on-device, without any webcam or microphone data leaving your computer.
See Publication​