Project 3

Explore a large data set

There are many data sets available: planes (1, 2), taxi, Kaggle, Data world,European Social Survey, V-dem, Pew research,Food data, …, your own source. Explore selected data set: select variables and explore them (distribution, extreme values, …), explore relations among variables (pairs, clustering, regression, derived quantities, interesting observations), ideas for detailed analyses. Report your findings.

For example, what is the impact of vaccination on the COVID situation in different countries?

The selected data set has to have at least 10000 units or in the case of temporal data set the product Number of units X Number of time points is at least 10000.

Before starting the analysis send me a note about your selection for confirmation.

n student dataset
1 Okpe Ikechukwu Amos Bank Turnover Dataset/Kaggle
2 Серхио Кампосортега Рендон Bank Marketing Data Set/UCI
3 Mohammad Jawad Bakhteiary Avocado
4 Ekaterina Kibalchich Monthly prices. December 2022
5 Evgenia Malinskaya Credit scores/Kaggle
6 Alisa Ignatova Food data
7
8
9
10
11
12
13
14
15



Students; EDA

ru/hse/eda22/stu/p3.txt · Last modified: 2022/12/15 14:10 by vlado
 
Except where otherwise noted, content on this wiki is licensed under the following license: CC Attribution-Noncommercial-Share Alike 3.0 Unported
Recent changes RSS feed Donate Powered by PHP Valid XHTML 1.0 Valid CSS Driven by DokuWiki