PinnedPublished inAnalytics VidhyaA Random Forest Classifier with Imbalanced DataI’m going to walk through setting up a Random Forest Classifier for multiclass classification with imbalanced data…Jul 12, 2020Jul 12, 2020
Geospatial EDA — Tableau Vs. PythonDoes Tableau fit into the exploratory data analysis portion of the data science life cycle? Or is it something to leave until the end when…Jan 23, 2021Jan 23, 2021
Published inTowards Data ScienceWho designed this database?A well designed database and up to date documentation is the ideal, but if that is not the case and you are not in the position to improve…Jan 15, 2021Jan 15, 2021
Congressional District Data Gathering and EDA pt.1With the availability of the 2020 census data fast approaching followed by the apportionment of congressional and state legislative…Jan 9, 2021Jan 9, 2021
Published inTowards Data ScienceMachine Learning — An Ethics ReviewI’m looking back at my projects from my Data Science program at Flatiron and reviewing the ethical concerns that each has…Nov 3, 2020Nov 3, 2020
Published inAnalytics VidhyaDeploy an NLP model with Streamlit and HerokuAfter finishing a recent Fake News Classification Project, I wanted to build a simple webapp that used my model.Oct 31, 20201Oct 31, 20201
Too Good to be True — NLP“It can’t be this easy.” That’s what I said to myself when getting started on my latest NLP project…Oct 25, 2020Oct 25, 2020
Published inAnalytics VidhyaPre-Processing Tweets for Sentiment AnalysisWhen doing any Natural Language Processing (NLP) you will need to pre-process your data. I will be working with a set of tweets…Sep 18, 20202Sep 18, 20202
Published inAnalytics VidhyaWaiting For the Bathroom — a Linear Regression StoryI recently completed a project involving a multivariate linear regression to predict housing prices, and guess what…people really don’t…May 20, 2020May 20, 2020
Published inAnalytics VidhyaAdjusting for Inflation When Analysing Historical Data with PythonIf you find yourself analyzing a dataset with historical monetary data like I was, you will want to adjust those values for inflation.Mar 16, 2020Mar 16, 2020