|
Harshit Agarwal
I'm a Post Graduate student at IIIT Bengaluru.
Through my coursework and projects,
I have gained practical experience in cleaning and analyzing datasets,
applying statistical and analytical techniques to uncover patterns,
and presenting insights in a clear and effective way.
Previously, I was responsible for managing and running my family business,
where I handled client negotiations, built strong relationships,
and ensured smooth day-to-day operations to manage and grow the business.
Email /
CV /
Github /
Leetcode /
Kaggle
|
|
Visualizations
All visualizations made using Tableau.
|
|
|
Electric Vehicle Data Analysis Visualization
Tableau
This is workbook of Tableau, visualizing the dataset of Electric Vehicle Population Data.
It was designed by keeping various KPI Requirements such as
Average Electric Range, Total BEV Vechicles Relative to
Total Vehicles, Total PHEV Vehicles Relative to Total vehicles etc.
Charts like Line/Area chart, Map Chart, Pie/Donut Chart, Bar Chart etc were used in Visualization and finally a Dashboard was made for Data Discovery.
|
Data Analysis
Projects based on analysing Datasets, mostly comprising of EDA, Feature Engineering processes for the Data.
|
|
|
Insurance Pricing forecastor Using XGBoost
Numpy, Jupyter Notebook, Pandas, Matplotlib, XGBoost
Built a XGBoost Regression Model to that helps establish the rates of premium by predicting the charges or payouts done by the firm.
Achieved a total of 15-20% improvement in RMSE over baseline models such as Linear Regression. The approach for the
project was 1.) Exploratory Data Analysis(EDA) 2.) Build and Evaluate baseline linear model 3.) Improve on the baseline linear model with Data Preprocessing.
4.) Improve the model training process using Sklearn's Pipeline and compare the results of final model using RMSE Error Values.
Other processes involed in the stage of developing this project was Understanding Correlation between the Categorical and Target Variables and various methods used in Correlation Analysis.
Implementing BayesSearchCV for XGBoost Hyperparameter Optimization
|
|
|
Uber Data Analysis
Numpy, Jupyter Notebook, Pandas, EDA, Matplotlib
This project analyse the Uber dataset provided in Kaggle.
The following KPI's were investigated for the best outcome of the Dataset -
1.) ARPU - Average Revenue Per User
2.) Usage Frequency per Month
3.) Monthly Active Users (MAU)
4.) Retention Rate and etc...
|
|
|
Titanic - Machine Learning from Disaster - Kaggle Competition
Python, Numpy, Pandas, Matplotlib, Machine Learning
This project analyse the Titanic - Machine Learning from Disaster dataset provided in Kaggle.
|
|
|
Qiskit Practice Notebooks
Qiskit, Quantum Circuits, Jupyter Notebook, Python
Implementing Quantum Ciruits
|
|
|
Amazon ML Challenge 2025
Machine Learning, Jupyter Notebook, Python, Transformers
Ranked 794 amongst 20,000+ Participated Teams
Problem Description - In e-commerce, determining the optimal price point for products is crucial for marketplace success and customer satisfaction.
Develop an ML solution that analyzes product details and predict the price of the product.
(The relationship between product attributes and pricing is complex - with factors like brand, specifications, product quantity directly influence pricing.)
We used TF-IDF Vectorization for text data
and LightGBM Regression on numerical features extracted from the catalog content.
|
|
|
Wunder Fund RNN Challenge
Machine Learning, Python, LSTM
Current Rank of 261 amongst 2300+ Participants.
Problem Description - In this competition, you are invited to build a model that predicts the next market state from a sequence of prior states. This is a very challenging endeavor due to the complexity of the data: many standard time-series statistical assumptions are not met here.
Yet the problem is feasible — Wunder Fund's 10 years of successful trading prove it.
The task mirrors problems quantitative researchers face daily.
You'll have to be smart — in the HFT domain, inference needs to be made under very tight time constraints, so any practical solution must be nimble enough to run on CPU.
Ongoing Competition
|
|
Articles
|
Under Construction
|
|
Papers
|
Under Construction
|
|