About myself

Welcome to my website! I am a data analyst with a recent Marketing Analytics degree from Tilburg University. I have experience working in CRM contexts and I am skilled in handling large and complex databases. My expertise also includes proficiency in using data visualization tools to effectively communicate findings and insights. I am excited to share my knowledge and experience with you, and I look forward to helping you with your data needs.

This portfolio contains a selection of analytical projects that I have worked on, showcasing my skills in data analysis and interpretation. From data visualization to statistical analysis, these projects demonstrate my ability to turn data into actionable insights.

Analytics Work

Airbnb ideal accommodation price estimation

(RStudio, Shiny App)

Click for more

Customer Lifetime Value in mobile gaming

(SQL, RStudio)

MSc Thesis×

Latent attrition models in mobile gaming. The role of RFM values and playing behaviours covariates in Customer Lifetime Value Predictions

This study, in collaboration with an international gaming company, explores new methodologies to predict customer future spending on mobile games. The game observed is a mobile freemium game, for privacy reasons neither the name of the game nor the data employed are not published. The predictor utilized in the study is the Pareto NBD, a latent attrition model, which estimates simultaneously two different processes of customers:

- Attrition (responsible for churn)
- Transactional (models the number of purchases)

The Standard version of the Pareto NBD employs only the RFM values for the whole customer base. These values are Recency (when the last purchase took place), Frequency (how many purchases the customer has done in the observed period), and Monetary (average amount of past transactions) values. The model performs a holdout split, and only employs the RFM values only in the Calibration period, and computes future spending predictions in the validation period. The image below summarizes this process.

The analysis takes into consideration a cohort of customers that made their first purchase on a certain week of the year, and their transactions are observed for the whole following year. The calibration period consists of the first four months of the year and the remaining ones correspond to the validation period. As the figure shows the predicted spendings in the holdout (or validation period) are then compared with the actual data, available in the dataset. This allows to asses the accuracy of the model and eventually claim whether including the covariates improve the predictions.

Pareto NBD standard

The predictions of the model can employed in diffrent ways. In the study it is observed how accurate predictions are at the aggregate and the individual level. The observed dataset consist of 581 customers and the MAE (mean absolute error) of the Pareto NBD is =4.27, while the total residuals is =0.52. This highlight the good in-saple fit of the model and that it can successfully be employed to evaluate the profitability of a certain customer base as the image below shows.

Finally, the model is employed for a classification task, to detect top spending customers. Here, the obseved units are divided in non top-spending and top-spending. The results yielded by the model depend on the % of costumers that wants to be reached, the best results are observed when we try to identify the top 5% of spending clients. The hit rate (or precision) is =0.56. This means that the model can spot 56% of these most valuable clients.

In terms of out-of-sample performance, the computed Pareto NBD Standard yield surprising results. The MAE is even lower when predictions are made on an external sample.

Pareto NBD with covariates

The improvement of the model when accounting for the covariates are not significant. This negative results is very likely observed because an important assumption of the covariates is not respected in thos case stusy. The assumption is that the covariates remain constant between the calibration and validation period. Since the control variables that are considered include playing behaviors, such as time spent on the game or number of session during a week, these are very likely to change across the observed period.

Web scraping e-books from Bol.com

(Python: Selenium, Pandas)

Inspecting the e-books market via web scraping on Bol.com×

The e-book market has significantly grown in recent years, with more and more readers opting for digital versions of their favorite books. The advantages of e-books include their accessibility, portability, and low costs. From the authors’ point of view, e-books provide opportunities to reach a wider audience and create generate higher profits due to the lack of printing costs. Given the potential of the e-book market, it is essential to stay up-to-date on market trends and customer preferences.

The goal of this project is to provide authors and publishers with up-to-date information regarding e-books. More specifically, I built a web scraper that gathers important information on all e-books for kids available on the website Bol.com. The goal of this project was to extract data such as price, topics, number of pages, languages, and other relevant details, which could then be used by writers to create e-books in the ideal format.

To accomplish this, I used a web scraping tool to navigate the website and extract the desired information from the e-book listings of Bol.com. The screenshot below shows the information available on the main navigation page of kids’ e-books. Various insights can already be extracted from this page such as:

To obtain the required data the scraper carried out the following tasks:

- Navigate through all the pages of the category (500 pages)
- Collect the links (seed) of all the e-books in each page
- Navigate in every link of the seed and extract all the information of interest

The information gathered from this project was used by a writer to create an e-book with illustrations for kids, aligned with market trends and customer preferences. The collected dataset may also help to identify areas of opportunity for creating e-books that filled gaps in the market.

The following repository contains the source files used for extracting the data and the final dataset generated: Web scraping from Bol.com

Multiple Correspondence Analysis on chess games

(Excel, RStudio)

Dashboards created from open source datasets

(Power BI, Tableau)

The Impact of a Failed Acquisition: study case of Kraft Heinz and Unilever

(Excel, SPSS)

Work in Progress

The projects displayed on the website are a selection of my most significant work and are intended to showcase my skills and field of expertise. Additional projects will be posted soon.

Note that the existing projects of the portfolio do not include the full code or complete information as they may be confidential. Please contact me if you want to know more about them.

Skills & Software

Rstudio

(Tidyverse, Shiny, CLVTools)

SQL

SPSS

Tableau

Python

(Pandas, Numpy, Selenium, Matplotlib)

Excel

Power BI

HTML & CSS

proficient
native
intermediate
intermediate

Get In Touch

Email
lorenzo.taddei@outlook.it

Phone
+39 3333578520
WhatsApp

Airbnb ideal accommodation price estimation

Airbnb perfect accommodation price estimation×

Customer Lifetime Value in mobile gaming

MSc Thesis×

Latent attrition models in mobile gaming. The role of RFM values and playing behaviours covariates in Customer Lifetime Value Predictions

Pareto NBD standard

Pareto NBD with covariates

Web scraping e-books from Bol.com

Inspecting the e-books market via web scraping on Bol.com×

Multiple Correspondence Analysis on chess games

BSc Thesis×

Multiple Correspondence Analysis on chess games

Dashboards created from open source datasets

Data visualization ×

Dashboards and data story telling with Tableau and Power BI.

Tableau

Power BI

The Impact of a Failed Acquisition: study case of Kraft Heinz and Unilever

Case study ×

A Comprehensive Study of the Effects of a Failed Acquisition: The Kraft Heinz and Unilever Case.

Work in Progress

Rstudio

SQL

SPSS

Tableau

Python

Excel

Power BI

HTML & CSS

Get In Touch

You think my portfolio would be valuable to someone in your network? Copy the link by clicking here, and share.

You think my portfolio would be valuable to someone in your network?
Copy the link by clicking here, and share.