Virajdatt KohirPandas Equivalents in PysparkThis year I have been using pyspark extensively at my job. It’s been nearly 6 months and I have grown to love pyspark for it’s efficiency…Jul 13, 2023Jul 13, 2023
Virajdatt KohirTutorial on Hyper-parameter tuning of Vanilla Transformer Encoder of non-textual sequence data for…Table of Contents:May 28, 2023May 28, 2023
Virajdatt KohirData Science and Machine Learning Interview tips and questions for new grads.I recently graduated with a Masters's Degree in Data Science. For the past year, I have been preparing for interviews initially for…Dec 31, 2022Dec 31, 2022
Virajdatt KohirAirflow, writing custom operators and publishing them as a package: Part 2.Welcome back to the second part of the series of building and packaging a custom airflow operator. In the previous part, we set up our…Aug 26, 2022Aug 26, 2022
Virajdatt KohirAirflow, writing custom operators and publishing them as a package: Part 1.Welcom to this part 1 of the tutorial series on writing airflow custom operators and publishing them as a package. In this part of the…Aug 19, 2022Aug 19, 2022
Virajdatt KohirIncremental Machine Learning for Streaming data with river: Part 3; Classification Algorithms…IntroJul 24, 2022Jul 24, 2022
Virajdatt KohirIncremental Machine Learning for Streaming data with river: Part 2, Classification AlgorithmsWelcome back, this is part 2 of the “Incremental Machine Learning for Streaming data with river” series. In this present article, we will…Jun 5, 2022Jun 5, 2022
Virajdatt KohirIncremental Machine Learning for Streaming data with River: Part 1The amount of data generated, processed and analyzed every day is increasing at a tremendous pace. Modern smart devices(ex: cell phones…May 27, 2022May 27, 2022
Virajdatt KohirPandas Essentials:Dealing with categorical variables using the value_counts methodPandas is an amazing data manipulation and data wrangling library which makes an aspiring data scientist feel like a superhuman. I still…Apr 3, 2022Apr 3, 2022
Virajdatt KohirPandas, tips to deal with huge datasets!I have been using pandas for more than 4 years now and have always done all my ETL using pandas before training a machine learning model…Feb 20, 2022Feb 20, 2022