09 September 2012

Blog translation

Articles on the International Education Statistics blog can now be translated into more than 60 languages, including Arabic, Chinese, French, Russian and Spanish. To translate an article, select a language from the "Translate" drop-down menu in the sidebar, above the search box.

The translations are provided by Google Translate. They are not always perfect and the formatting of articles is not always preserved. Even so, the translations may be useful for readers who prefer a language other than English.

Related articles

External links

Friedrich Huebler, 9 September 2012, Creative Commons License
Permanent URL: http://huebler.blogspot.com/2012/09/translation.html

31 August 2012

Guide to creating maps with Stata

Most charts and maps on this site were created with the Stata statistical software package. This guide explains how maps like those with adult and youth literacy rates in 2010 can be created with Stata. The article supersedes an earlier version from 2005 and introduces updated maps with current country borders. For example, South Sudan, which seceded from Sudan in 2011, is shown as a separate country on the new maps. The instructions below are for Stata version 9 or later. Users of Stata 8 are referred to the guide from 2005. The creation of maps is not supported in older versions of Stata.

Requirements

  • Stata version 9.2 or later.
  • spmap: Stata module for drawing thematic maps, by Maurizio Pisati. spmap can be installed in Stata with this command:
      ssc install spmap
  • shp2dta: Stata module for converting shapefiles to Stata format, by Kevin Crow. shp2dta can be installed in Stata with this command:
      ssc install shp2dta
  • Shapefile: A shapefile is a data format for geographic information systems. For the maps in Figures 1 and 2, please download this public domain shapefile from Natural Earth:
    110m-admin-0-countries.zip (158 KB, world map with country borders, scale 1:110,000,000)

Step 1: Convert shapefile to Stata format

  • Unzip 110m-admin-0-countries.zip to a folder that is visible to Stata. The archive contains four files:
    ne_110m_admin_0_countries.dbf
    ne_110m_admin_0_countries.prj
    ne_110m_admin_0_countries.shp
    ne_110m_admin_0_countries.shx
  • Start Stata and run this command:
      shp2dta using ne_110m_admin_0_countries, data(worlddata) coor(worldcoor) genid(id)
  • Two new files will be created: worlddata.dta (with the country names and other information) and worldcoor.dta (with the coordinates of the country boundaries).
  • If you plan to superimpose labels on a map, for example country names, run the following command instead, which adds centroid coordinates to the file worlddata.dta:
      shp2dta using ne_110m_admin_0_countries, data(worlddata) coor(worldcoor) genid(id) genc(c)
  • Please refer to the spmap documentation to learn more about labels.
  • The DBF, PRJ, SHP, and SHX files are no longer needed and can be deleted.

Step 2: Draw map with Stata

  • Open worlddata.dta in Stata.
  • For the example maps, create a variable with the length of each country's name. The Stata command for this is:
      generate length = length(ADMIN)
  • Draw a map that indicates the length of all country names with this command:
      spmap length using worldcoor.dta, id(id)
  • The default map (Figure 1) is grayscale, it shows Antarctica, there are four classes for the length of the country names, the legend is very small, and the legend values are arranged from high to low.

Figure 1: Length of country names (small scale map, default style)

Click image to enlarge.

  • A second map without Antarctica, with a blue palette, five classes, and with a bigger legend with values arranged from low to high (Figure 2) can be drawn with this command:
      spmap length using worldcoor.dta if ADMIN!="Antarctica", id(id) fcolor(Blues) clnumber(5) legend(symy(*2) symx(*2) size(*2)) legorder(lohi)
  • Darker colors on the map indicate longer country names, ranging from 4 (for example Cuba and Fiji) to 35 characters (French Southern and Antarctic Lands).
  • Please read the Stata help file for spmap to learn about the many additional options for customization of maps.

Figure 2: Length of country names (small scale map, blue palette)

Click image to enlarge.

Alternative maps with more detail

The shapefile that was used for Figures 1 and 2 was designed for small maps. It contains the borders for 177 countries and territories and does not include smaller geographic units like Hong Kong, Monaco, or St. Vincent and the Grenadines. As an alternative to the small scale map in Figures 1 and 2, Natural Earth offers shapefiles with more detail that were designed for larger maps.

  • To create the map in Figure 3, download this shapefile from Natural Earth, which has information for 242 countries and territories:
    50m-admin-0-countries.zip (1.3 MB, world map with country borders, scale 1:50,000,000)
  • Unzip 50m-admin-0-countries.zip to a folder that is visible to Stata.
  • Run this Stata command to convert the shapefile to Stata format:
      shp2dta using ne_50m_admin_0_countries, data(worlddata2) coor(worldcoor2) genid(id)
  • If you need Stata files with centroids, run this command instead:
      shp2dta using ne_50m_admin_0_countries, data(worlddata2) coor(worldcoor2) genid(id) genc(c)
  • Open worlddata2.dta in Stata.
  • Create a variable with the length of each country's name:
      generate length = length(ADMIN)
  • Draw the map in Figure 3:
      spmap length using worldcoor2.dta if ADMIN!="Antarctica", id(id) fcolor(Blues) clnumber(5) legend(symy(*2) symx(*2) size(*2)) legorder(lohi)
  • The map takes longer to draw than the map in Figures 1 and 2 because it is more detailed and shows more geographic units. The names of the countries and territories on the map have a length up to 40 characters (South Georgia and South Sandwich Islands).

Figure 3: Length of country names (medium scale map)

Click image to enlarge.

  • To create the map in Figure 4, download this shapefile from Natural Earth, which has information for 253 countries and territories, including small islands like the Ashmore and Cartier Islands:
    10m-admin-0-countries.zip (6.7 MB, world map with country borders, scale 1:10,000,000)
  • Unzip 10m-admin-0-countries.zip to a folder that is visible to Stata.
  • Run this Stata command to convert the shapefile to Stata format:
      shp2dta using ne_10m_admin_0_countries, data(worlddata3) coor(worldcoor3) genid(id)
  • If you need Stata files with centroids, run this command instead:
      shp2dta using ne_10m_admin_0_countries, data(worlddata3) coor(worldcoor3) genid(id) genc(c)
  • Open worlddata3.dta in Stata.
  • Create a variable with the length of each country's name:
      generate length = length(ADMIN)
  • Draw the map in Figure 4:
      spmap length using worldcoor3.dta if ADMIN!="Antarctica", id(id) fcolor(Blues) clnumber(5) legend(symy(*2) symx(*2) size(*2)) legorder(lohi)
  • The map takes longer to draw than the maps in Figures 1, 2 and 3 because it has the largest amount of detail. The differences between the maps in Figures 3 and 4 can be seen by clicking on the images to enlarge them. Figure 4 has more islands and more detailed shorelines. The names of the countries and territories on the map in Figure 4 have a length up to 40 characters (South Georgia and South Sandwich Islands).

Figure 4: Length of country names (large scale map)

Click image to enlarge.

Software used in this guide

  • Stata: statistical software package
  • spmap: Stata module for drawing thematic maps, by Maurizio Pisati
  • shp2dta: Stata module for converting shapefiles to Stata format, by Kevin Crow
  • 110m-admin-0-countries.zip: small scale (1:110,000,000) Natural Earth world map with country borders (158 KB)
  • 50m-admin-0-countries.zip: medium scale (1:50,000,000) Natural Earth world map with country borders (1.3 MB)
  • 10m-admin-0-countries.zip: large scale (1:10,000,000) Natural Earth world map with country borders (6.7 MB)

Related articles

External links

Friedrich Huebler, 31 August 2012 (edited 2 September 2012), Creative Commons License
Permanent URL: http://huebler.blogspot.com/2012/08/stata-maps.html

31 July 2012

61 million out-of-school children in 2010

61 million children of primary school age were out of school in 2010, down from a high of over 110 million out-of-school children in the mid-1990s, according to new estimates by the UNESCO Institute for Statistics (UIS). 32 million or 53% of the 61 million out-of-school children were girls.

The trend in out-of-school numbers since 1990 is illustrated in Figure 1. Much of the global progress over the past 15 years is due to developments in South and West Asia, where the number of out-of-school children fell from a high of 41 million in 1998 to 13 million in 2010.

Progress in sub-Saharan Africa, another region with a historically large number of out-of-school children, has been much more modest, by comparison. Here, the number of children out of school decreased from a high of 43 million in 1996 to 31 million in 2010. As a result, sub-Saharan Africa today is home to half of all out-of-school children worldwide.

In relative terms, the global out-of-school rate fell from 18% in the early 1990s to 9% in 2010, in spite of a large increase in the number of children of primary school age over the same period. However, the estimates by UIS also show that the out-of-school rate and the number of out-of-school children have stagnated over the past three years, partly because sub-Saharan Africa is struggling to increase enrolment rates in primary education while being confronted with continued strong population growth. At this rate, the world is unlikely to reach the Millennium Development Goal and Education for All goal of universal primary education by 2015.

Figure 1: Global number of out-of-school children of primary school age, by region and sex, 1990-2010
Global number of children out of school from 1990 to 2010
Source: UNESCO Institute for Statistics, July 2012. - Click image to enlarge.

The regional distribution of children in and out of school is illustrated in Figure 2. The width of each region in the graph indicates the size of the population of primary school age. The height of the bars indicates which proportion of children in each region is in or out of school.

In absolute terms, Sub-Saharan Africa has the highest number of children out of school (31 million), although its primary school-age population (132 million) is smaller than that of two other regions: South and West Asia, and East Asia and the Pacific. In relative terms, Sub-Saharan Africa also has the highest out-of-school rate of all regions. 23% of all primary school-age children have either never attended school or left school without completing primary education.

In South and West Asia, the proportion of out-of-school children (8%) is smaller than in sub-Saharan Africa and the Arab States, but in absolute terms, the region is home to 13 million out-of-school children, second only to sub-Saharan Africa. More than half of all out-of-school children in South and West Asia live in only two countries: India (2.3 million) and Pakistan (5.1 million).

East Asia and the Pacific has 168 million children of primary school age, nearly as many as South and West Asia, but only 4% are out of school. Still, due to the large primary school-age population in the region, this means that 7 million children are excluded from education.

The remaining regions have significantly fewer children out of school: Arab States (5.0 million), Latin America and the Caribbean (2.7 million), North America and Western Europe (1.3 million), Central and Eastern Europe (0.9 million), and Central Asia (0.3 million).

Figure 2: Global distribution of children of primary school age in and out of school, 2010
Distribution of children in and out of school, by region, 2010
Source: UNESCO Institute for Statistics, July 2012. - CA = Central Asia, CEE = Central and Eastern Europe, LAC = Latin America and the Caribbean, NAWE = North America and Western Europe. - Figure 2 was created with the spineplot add-on for Stata (Cox 2008). - Click image to enlarge.

Related articles

External links

Friedrich Huebler, 31 July 2012 (edited 2 August 2012), Creative Commons License
Permanent URL: http://huebler.blogspot.com/2012/04/oos.html

21 June 2012

Merger of huebler.info and huebler.blogspot.com

As announced in January, the huebler.info website was shut down in June 2012. As of 21 June, all requests for pages at huebler.info are redirected to huebler.blogspot.com.

huebler.info was mainly a mirror of articles posted on the International Education Statistics blog and nearly all content from the old huebler.info site is available at huebler.blogspot.com. Please use the blog search in the right sidebar, under "Search this site", or the blog archive to find content from the old site. The most popular pages from huebler.info, identified through access statistics collected between January and June 2012, are listed below with the corresponding huebler.blogspot.com URL.

Please replace any huebler.info links in your bookmarks and on your website by the corresponding huebler.blogspot.com links. If you are unable to find content from huebler.info with the blog search, blog archive or the list below, you can contact the author by email at fhuebler@gmail.com. I apologize for any inconvenience caused by the shutdown of the old site.

Education statistics

Stata

Related articles

Friedrich Huebler, 21 June 2012, Creative Commons License
Permanent URL: http://huebler.blogspot.com/2012/06/blog.html

31 May 2012

Adult and youth literacy in 2010

The UNESCO Institute for Statistics (UIS) released new literacy data in April 2012, with updated estimates of adult and youth literacy. In 2010, the latest year with data, 84% of the global population 15 years and older were estimated to be able to read and write (see Table 1). At the regional level, literacy rates are highest in Central Asia, Central and Eastern Europe, East Asia and the Pacific, and Latin America and the Caribbean. In these regions at least 9 out of 10 adults are literate. Literacy rates can be assumed to be as high in North America and Western Europe but not enough countries in that region collect literacy data to allow the calculation of a regional average. By contrast, adult literacy rates are significantly lower in the Arab States (75%), and in South and West Asia and sub-Saharan Africa (63% in both regions). Women are considerably less likely to be literate than men in the Arab States, East Asia and the Pacific, South and West Asia, and sub-Saharan Africa. Globally, the female adult literacy rate was estimated to be 80% in 2010, compared to a literacy rate of 89% for men. As a consequence, nearly two thirds (497 million) of the adult illiterate population in 2010 (775 million) were women.

Table 1: Adult and youth literacy rate, 2010
Region Adult literacy rate (%) Youth literacy rate (%)
Total Male Female Total Male Female
Arab States 74.7 83.3 65.7 89.1 92.4 85.6
Central Asia 99.5 99.6 99.4 99.7 99.6 99.8
Central and Eastern Europe 97.9 99.0 97.0 99.1 99.3 98.9
East Asia and the Pacific 94.2 96.7 91.6 98.8 98.9 98.7
Latin America and the Caribbean 91.4 92.1 90.7 97.2 97.0 97.4
North America and Western Europe - - - - - -
South and West Asia 62.7 74.0 51.8 80.5 86.6 74.7
Sub-Saharan Africa 62.6 71.0 54.2 71.8 76.4 66.8
World 84.1 88.6 79.7 89.6 92.2 87.1
Source: UNESCO Institute for Statistics, Data Centre, April 2012

The disparities between regions with high and low literacy rates are readily apparent from the map in Figure 1, which displays the average literacy rate in the seven Education for All (EFA) regions with data listed in Table 1. For a description of the regional groupings, please refer to a past article about the EFA regions on this website.

Figure 1: Regional adult literacy rate, 2010

Source: UNESCO Institute for Statistics, Data Centre, April 2012. - Click image to enlarge.

Youth literacy rates, for the population 15 to 24 years of age, are higher than adult literacy rates in all regions as a result of improved access to education among younger generations. Globally, 90% of all youth are able to read and write. Central Asia, Central and Eastern Europe, East Asia and the Pacific, and Latin America and the Caribbean have reached or are approaching universal literacy among their young population. The same can be assumed for North America and Western Europe, but no regional average is available from the UIS (see Table 1 and Figure 2). The disparity in literacy rates between men and women is generally smaller among the population 15 to 24 years than among the population 15 years and older. Yet, in the Arab States, South and West Asia, and sub-Saharan Africa, young women remain less likely to be able to read and write than young men. The global youth literacy rate in 2010 was 92% for men and 87% for women.

Figure 2: Regional youth literacy rate, 2010

Source: UNESCO Institute for Statistics, Data Centre, April 2012. - Click image to enlarge.

The regional averages can conceal large differences between countries within a region. This is particularly true for the adult literacy rate in the Arab States, South and West Asia, and sub-Saharan Africa (see Figure 3). In sub-Saharan Africa, for example, the adult literacy rate is below 30% in Burkina Faso and Niger - the countries with the least literate population worldwide - and above 90% in Equatorial Guinea, Seychelles, and Zimbabwe.

Figure 3: Adult literacy rate, 2010

Source: UNESCO Institute for Statistics, Data Centre, April 2012. - Click image to enlarge.

Disparities between countries within a region can also be observed for the youth literacy rate, but to a lesser degree than for the adult literacy rate (see Figure 4). Similar to the adult literacy rate, the greatest disparities exist in sub-Saharan Africa, where youth literacy rates range from 37% in Niger to 99% in the Seychelles and Zimbabwe.

Figure 4: Youth literacy rate, 2010

Source: UNESCO Institute for Statistics, Data Centre, April 2012. - Click image to enlarge.

To make it easier to explore its literacy data, the UNESCO Institute for Statistics has created an interactive visualization that combines a map showing adult and youth literacy rates, a graph with literacy rates by sex, and a scatter plot with the correlation between GDP per capita and literacy. A screenshot of the visualization is shown in Figure 5. The full visualization is available on the website of the UIS. Literacy data are also contained in the recently published World Atlas of Gender Equality in Education by UNESCO.

Figure 5: UIS data visualization with adult and youth literacy rate, 2010

Source: UNESCO Institute for Statistics, May 2012. - Click image to enlarge.

Related articles

External links

Friedrich Huebler, 31 May 2012 (edited 1 June 2012), Creative Commons License
Permanent URL: http://huebler.blogspot.com/2012/05/literacy.html