Project 3

Explore a large data set

There are many data sets available: planes (1, 2), taxi, Kaggle, Data world,European Social Survey, V-dem, Pew research,Food data, …, your own source. Explore the selected data set: select variables and explore them (distribution, extreme values, …), explore relations among variables (pairs, clustering, regression, derived quantities, interesting observations), and ideas for detailed analyses. Report your findings.

The selected data set has to have at least 10000 units or in the case of temporal data set the product Number of units X √(Number of time points) is at least 10000.

Before starting the analysis send me an e-mail about your selection for confirmation.

n student dataset
1 p3
2 p3
3 p3
4 p3
5 p3
6 p3
7
8
9
10
11
12
13
14
15



Students; EDA

ru/hse/eda24/stu/p3.txt · Last modified: 2024/02/19 23:48 by vlado
 
Except where otherwise noted, content on this wiki is licensed under the following license: CC Attribution-Noncommercial-Share Alike 3.0 Unported
Recent changes RSS feed Donate Powered by PHP Valid XHTML 1.0 Valid CSS Driven by DokuWiki