80 likes | 183 Views
SJTU CMGPD 2012 Methodological Lecture. Day 10 Geographic Variables. Geographic Identifiers. REGION Based on dataset North, Central, South-Central, South DISTRICT Typically based on administrative district recorded in register before village name
E N D
SJTU CMGPD 2012Methodological Lecture Day 10 Geographic Variables
Geographic Identifiers • REGION • Based on dataset • North, Central, South-Central, South • DISTRICT • Typically based on administrative district recorded in register before village name • Corresponds roughly but not exactly to contemporary shi. • UNIQUE_VILLAGE_ID • Based on village name recorded in registers • Followed by consolidation to account for variations
Geographic identifiers • Differences by REGION and DISTRICT can be substantial, and it is generally a good idea to include one of them as a control. • UNIQUE_VILLAGE_ID is more useful as a control variable
UNIQUE_VILLAGE_ID • Do not use Guosantun (DATASET == 10) 1780 or Aerjishan (DATASET == 25) 1906 in any analysis involving village of residence. • Address codes in these two registers were not entered properly • Will be corrected in a future update • Many villages have more than one descent group, and many descent groups are spread over more than one village
Restricted Data • Names of individuals • Names of villages • Physical locations of villages • For about 90% of the population • Latitudes and longitudes that can be linked • Requires signing a contract with ICPSR • To prevent inappropriate use of the data • For anyone interested in genealogy, Genealogical Society of Utah already has the names