1 Introduction

This book contains a series of mini projects to learn data science using world data from different sources.

Exploratory data analysis of total world population, regions, and countries.

Where to find world data? Many!

Primary sources:

  • International Database (IDB) from the United States Census Bureau provides population estimates and projections for 227 countries and areas.

  • World Population Prospects 2022 from the United Nations is the latest assessment considers the results of 1,758 national population censuses conducted between 1950 and 2022.

  • World Bank offers world population estimates from 1960 to 2021.

Secondary sources:

  • Gapminder foundation collects data from different resources on world population from 1800 to 2100.

  • Our World in Data brings together the most reliable and informative data sets.

Live population clocks:

R interface:

  • Gapminder package in R is a limited excerpt until 2017.

  • WDI package allows users to search and download data from over 40 datasets hosted by the World Bank.

Miscellaneous:

  • A collection of updated resources for accessing social science data on various topics from R.