In this post we will look at the time series workflow in python, looking at confirmed Covid-19 cases at the national and state level in Mexico. We will be following the Box-Jenkins method for fitti...
Time Series Analysis of the Covid-19 pandemic in Mexico
Fine-tune Sentence Transformers Models for Science Exam Questions
In this post we will be training a sentence-transformers model, for the Kaggle - LLM Science Exam competetion. If you don’t know how sentence transformers work I recommend checking out the package ...
Topic Modelling (Part 1): Introduction to text embeddings
Topic modelling is an unsupervised document classification task, in which we assign each document or phrase in a corpus with a label, depending on the abstract properties of each corpus entry. So t...
The Origins of Formula 1: (Part 2) Bayesian Analysis
Data In this post we will build and evaluate two bayesian models to predict the probability of a driver finishing in the top 5 of a given race. The data was scrapped from the FIA’s website, check ...
The Origins of Formula 1: (Part 1) Webscraping
Formula 1 data In the modern era of the sport each car can have between 150 and 300 sensors, generating an approximate of 300gb of data per car per grand prix weekend. This is of great use for the...