Using GIS to Evaluate a
New Source of Transportation Census Data:
The American Community Survey
Wende A. O’Neill, Ph. D. & Daniel Baldwin Hess
Introduction
In a rapidly changing society, many people have come to view the decennial (ten year) census as slow and antiquated. Data taken from a census captured only once in a ten-year period is not released until a few years after the decennial year and, for many applications, the data is stale before it is time for a recount. To improve the situation, the U.S. Bureau of the Census is implementing plans as the 21st century approaches for the American Community Survey (ACS). Using refreshed multi-year accumulations, the ACS will provide timely data by sampling households nationwide every month of the year to collect housing, social, and economic data.
Transportation planners typically spend a great amount of effort adjusting decennial Census data to fit their needs. The ACS, however, provides data users with timely, comparative housing, social, and economic data throughout the decade about communities and population groups. Since continuous measurement is an untested process in the United States, transportation planners have expressed concern about the different data structure under which information will be released. This study explores the problems of geographical and data-structuring differences by comparing ACS estimates to locally-derived transportation data that planners are accustomed to using.
Importance of Census Data to Transportation Planning
State and metropolitan planning organizations rely on data from the decennial census for a broad array of applications. Data from the long-form questionnaire, which includes questions covering place of work; mode of transportation to work; carpooling; travel time and time of departure to work;vehicles available; and mobility limitations, are used for planning and modeling travel behavior. These data have been summarized for some state and local areas in the Census Transportation Planning Package (CTTP). Travel demand forecasting models and other transportation planning activities apply these data to estimate the impact of transportation and land use policy on the transportation system. The Transportation Efficiency Act for the 21st Century (TEA-21), the Clean Air Act Amendments of 1990 (CAA 1990), and the Americans with Disabilities Act all increase transportation planning requirements and related data requirements of states and metropolitan planning organizations (MPOs) (Implications of Continuous Measurement 1996).
Census transportation data can be used alone to provide descriptive analysis of work trip patterns and, when compared with previous census-year surveys, of how those work-trip patterns are changing over time. In combination with other data, journey-to-work data supports a region’s travel demand forecasting package providing input on workplace attractions, work-trip origin-destination distribution patterns, work-trip departure times, work-trip length distribution, and travel-time distribution. To smaller metropolitan areas with limited resources, the survey becomes a foundation on which to build a model system, and in larger metropolitan areas the survey is a useful data base on observed travel behavior with which to calibrate or validate the model system.
The Census Bureau anticipates greater accuracy with continuous measurement in coding and interviewing due to permanent staffing, instead of the temporary staffing associated with taking the census every ten years (Implications of Continuous Measurement 1996). Because of the nature of data collection for the American Community Survey, there is high sampling error, so the data is released with caveats. The high sampling error is countered by moving averages, in which multi-year samples are averaged. Because census data would be collected over five years, instead of on the traditional "census day," the bureau will report "rolling averages" instead of numbers fixed in time.
This study extends the previous BTS study by focusing on the impact of the ACS on data for transportation planning and policy analysis. An analytical approach is taken to document the validity of estimates from the first year of the ACS demonstration project. This is accomplished by comparing 1996 ACS estimates of household characteristics and transportation variables for Multnomah County, Oregon to data developed locally by the Portland Metropolitan Service District (Portland Metro).
Description of the Data and Development of a GIS Application
Because geographical analysis is a framework for much of the transportation planning activity of Portland Metro and is used to link various data sets, GIS has been selected as the primary methodology for this study. Zoning systems are the principal way in which spatial structure is incorporated into transportation planning models. A zone system is a set of contiguous polygons (areas defined by at least three sides). As this study is focused on evaluating estimates from the ACS, census block groups are considered to be fundamental reporting zones. Because census geography is defined independently of its attribute data, census zones are considered " artificial statistical areas," as the geography is unrelated to zone characteristics (Goodchild et al 1993). In this sense, census data models are different from those commonly used for transportation data, where the spatial basis is dependent on unique land use and transportation characteristics.
In contrast to census geography, the design of transportation zoning systems is viewed as an important geographical component of spatial analysis. Transportation Analysis Zone (TAZ) structures endeavor to create homogeneous areas of socioeconomic characteristics and tripmaking tendencies of the population. Portland Metro maintains a set of TAZs for use in planning studies. The zones in the TAZ system constitute the objects, or geographical individuals, that are the basic units for observation and measurement of spatial phenomena.. The current Portland Metro travel demand model forecasting system uses 1260 zones for the four-county area, 481 of which are Multnomah County TAZs and examined in detail for this study.
Figure 1 shows the census block group and TAZ coverages for Multnomah County. Since TAZs and census block groups have noncoterminous boundaries, this study develops spatial basis change methods to transfer socioeconomic data (developed locally at the MPO level) from its reporting zones (TAZs) to an independent set of reporting zones (census block groups). The decision to convert TAZ data to block group data was based on the ACS providing confidence intervals for its estimates. Unfortunately, the Portland Metro data set does not include this information. Only when the spatial basis of household and transportation data has been changed can a one-to-one direct comparative analysis be performed between locally derived data and census data.
Selection of Variables for Comparison
Residential trip generation rates are usually classified by household structure, household size, and auto ownership. These characteristics are important determinants of travel tendencies and are typically considered in trip rate analyses, trip generation studies, and transportation demand models. Unlike household activity surveys, which record information about trips for all travel purposes, the census only reports information about the journey to work. But the journey to work is significant because it represents about 25% of all daily trips. More work trips occur during peak travel periods and the average trip length is greater than that for other trip purposes. Destinations for the work trip are less diverse, but origin/destination data are more readily available than the origins and destinations for other trip types (Racca 1998).
This study compares the following variables that are commonly used in travel demand forecasting (Ortúzar and Willumsen 1990):
Household Characteristics Transportation Characteristics
Number of Households Means of Transportation to Work
Distribution of Household by Age of Householder Number of Vehicles Available
Number of Persons in Household Average Travel Time to Work
Household Income Time Leaving Home to go to Work
The selected variables’ definition and structure are different for ACS and Portland Metro data. Table 1 lists the eight variables chosen for comparison [in Column (1)], each variable’s strata in Portland Metro data [in Column (2)], and each variable’s strata in ACS data [in Column (3)].
The chief source of Portland Metro data is the 1994/1995 Household Activity Survey. Both temporal and categorical adjustments are necessary to compare the two data sets. First, all Portland Metro data is adjusted to its 1996 equivalent using linear interpolation based on growth factors from other sources. Next, the data are manipulated so that the strata for each variable are identical. Converting some of the variables, such as age of householder and household size requires fairly straightforward aggregation of categories. However, other variables, such as income and means of travel to work (which is the number of work trips by mode) require a great deal of manipulation to achieve similar categories. Some categories cannot be readily converted so there is also information loss in the conversion process. For example, there is no direct conversion of transit trips walk access and park and ride trips to public transportation because we do not know how many of the park and ride trips result in a carpool, vanpool, or public transportation. In this model, we assumed that anyone who carpooled identified their mode as shared ride trips and park and ride trips belong to public transportation. We could not use the variable, time leaving home to go to work because of the aggregate nature of the Portland Metro data. For the travel time to work variable, Census reports the number of households in each time interval. An average value was calculated for each block group by multiplying the mid range value by the number of households, summing then dividing by the number of households. The minimum range value was used for the last interval and the worked at home households were assigned a travel time of 0 minutes. After the adjustments have been made, spatial methods are used to interpolate Portland Metro block group level between the two data sets.
Spatial Methods
A literature review performed for this study found regional and geographic science literature dealing with the theory of spatial basis change (see Batty and Sammons 1978, Batty and Sikdar 1982, Bracken and Martin 1995, Fisher and Langford 1995, Flowerdew and Openshaw 1987, Flowerdew et al 1991, Goodchild et al 1993, Openshaw and Taylor 1981) and planning and engineering literature dealing with applications of spatial basis change (see Flood and Siaurusaitis 1997, Gan 1994, Hsiao et al 1997, Keck 1997, O’Neill et al 1992, Peng and Dueker 1993, Zhao 1999). Each allocation method has different data needs and results in errors and different levels of accuracy. The literature review and previous studies point the way toward a methodology for this study.
First, an area ratio basis change is performed as a baseline areal interpolation scenario. The results are a point of departure for comparing other methods. Second, an areal interpolation with an auxiliary variable (street length) is performed. Third, a set of control zones is designated that capture known residential settlement patterns (from land use information). Because of the rich data source from the local planning agency, two roadway networks representing different street types are used. In the third approach, the spatial basis change is enhanced by adding more information about the source zones. This approach represents an improvement over past studies. The three spatial methods used in this study are as follows.
Spatial Data Conversion Method #1: Area Ratio This spatial data conversion method assigns data from TAZs to block groups according to the ratio of land area in the overlap area to land area in the TAZ. This is the common "polygon overlay" routine provided in most GIS.
Spatial Data Conversion Method #2: Roadway Coverage This spatial data conversion method overcomes the assumption of uniform coverage of households across land area by distributing households according to roadway coverage. The built roadway network gives an indication of an area’s ability to support population and the roadway network is used an auxiliary variable for distributing households. TAZ data are converted to block group geography according to the ratio of road length in the overlap area to road length in the TAZ.
Spatial Data Conversion Method #3: Roadway Coverage with Control Zones The preceding spatial data conversion method assumes that all households are located along streets. It is possible to enhance the previous method by assigning households based on the type of road and its surrounding land use. The spatial basis change then more closely models the actual distribution of population across geographic areas. This method introduces of a third set of zones, known as control zones, that contain information about residential density and land use.
|
Table 1 Data Comparability and Estimate Strata American Community Survey and Portland Metro |
||||
|
Estimate |
Portland Metro 1994 |
ACS 1996 |
||
|
Households |
Number of households |
Number of households |
||
|
Age of Householder |
Age of head: less than 25 years Age of head: 25 - 54 years Age of head: 55 - 64 years Age of head: 65 or more years |
Age of Householder by Household Type: 15 to 64 yrs Age of Householder by Household Type: 65 yrs or more (summed for various categories of household type) |
||
|
Household Size |
Household size: 1 person Household size: 2 persons Household size: 3 persons Household size: 4 + Persons |
Persons in Household: 4 Persons
1 Person 5 Persons
2 Persons 6 Persons
3 Persons 7 + Persons
|
||
|
Vehicles Available to Household |
Average number of vehicles available per household.
|
Owner   Renter 1 vehicle 1 vehicle 2 vehicles 2 vehicles 3 vehicles 3 vehicles 4 vehicles 4 vehicles 5 or more vehicles 5 or more vehicles |
||
|
Household Income |
Household Income 1994$ less than $ 17,500 $ 17,500 - $ 28,999 $ 29,000 - $ 40,499 $ 40,500 or more
|
Adjusted Household Income 1996$ less than $5,000 $37,500 - $39,999 $5,000 - $9,999 $40,000 - $42,499 $10,000 - $12,499 $42,500 - $44,999 $12,500 - $14,999 $45,000 - $47,499 $15,000 - $17,499 $47,500 - $49,999 $17,500 - $19,999 $50,000 - $54,999 $20,000 - $22,499 $55,000 - $59,999 $22,500 - $24,999 $60,000 - $74,999 $25,000 - $27,499 $75,000 - $99,999 $27,500 - $29,999 $100,000 - $124,999 $30,000 - $32,499 $125,000 - $149,999 $32,500 - $34,999 $150,000 or more $35,000 - $37,499 | ||
|
Means of travel to work |
Total home-based work trips Drive alone trips Shared ride trips Transit trips walk access Park and ride trips Walk and bike trips |
Car, truck or van: drove alone Bicycle Car, truck or van: carpooled Walked Motorcycle Other Means Taxicab Worked at home Public transportation (bus, trolley, subway, or railroad)
| ||
|
Travel time to work |
Aggregate average travel time to work in minutes for each TAZ. |
5 min 25 to 29 min 45 to 59 min 5 to 9 min 30 to 34 min 60 to 89 min 10 to 14 min 35 to 39 min 90 or more min 15 to 19 min 40 to 44 min Worked at home 20 to 24 min | ||
|
Time leaving home to go to work |
Time leaving home to go to work (for a sample of households) for 4 superdistricts within Multnomah County. |
12:00 -4:59 am 7:00 - 7:29 am 10:00 - 10:59 am 5:00 - 5:29 am 7:30 - 7:59 am 11:00 - 11:59 am 5:30 - 5:59 am 8:00 - 8:29 am 12:00 - 3:59 pm 6:00 - 6:29 am 8:30 - 8:59 am 4:00 - 11:59 pm 6:30 - 6:59 am 9:00 - 9:59 am Worked at home |
||
Results
ACS estimates are released in the form of a direct estimate as well as a lower and upper bound representing the 90 percent confidence interval. Portland Metro interpolated estimates at the block group level are compared to the ACS 90 percent confidence interval to see whether they lie below, within, or above the confidence interval. If the interpolated estimate falls within the ACS 90 percent confidence interval, the interpolated estimate is considered to be a good fit. The results of a block-group level analysis are summed for the entire county and shown in Table 2. The columns containing the percentage of interpolated estimates that lie within the ACS 90% confidence interval are highlighted.
Some differences in the ACS estimates and Portland Metro interpolated estimates can be explained by differences in source data. Table 3 gives the aggregate counts for Multnomah County variables from the ACS (for block groups) and Portland Metro (for TAZs) prior to any spatial modeling. Both data sets encompass all of Multnomah County, so the aggregate county sums for household characteristics and transportation variables can be compared to each other. Income ranges 1 and 4, and carpool, transit, and walk/bike means of transportation to work have large aggregate differences. This is partly attributable to temporal and structural changes made to the Portland Metro data to derive comparable data sets.
Both the area ratio and roadway coverage spatial basis change methods give different estimates for household characteristics and transportation variables. Error analysis (RMSE and MAE) of the results indicate that the control zone method outperforms the roadway coverage method and the roadway coverage method outperforms the area ratio method. The control zone method produces more block group-level estimates that fall within the ACS 90 percent confidence interval, and the roadway coverage estimates deviate less from the ACS direct estimate.
With regard to spatial analysis of the findings, in general there appears to be greater differences between the estimates of the two data sets on the periphery of Multnomah County, particularly the far northwestern and far eastern sections of the county. This is consistent with general findings of spatial interpolation, which performs poorest when the street pattern is irregular and/or the two zone systems are not of similarly sized zones.
Analysis of Variance
The three methods used to perform a spatial basis change from TAZ to Block Group and the ACS direct estimates are compared using Analysis of Variance. ANOVA tests were performed for eighteen variables to compare the means of the data generated from the three spatial change methods and the ACS generated data to each other. The null hypothesis tested was:
H0: macs = m1 = m2 = m3
Where macs = mean of data from the 1996 ACS
m1 = mean of data determined from the area ratio spatial basis change method
m2 =
mean of data determined from the roadway coverage spatial basis change method, andm3 = mean of data determined from the control zone spatial basis change method.
|
Table 2: Comparison of American Community Survey (ACS) Estimates with Portland Metro Spatially Interpolated Estimates for Household Characteristics and Transportation Data Universe: All Block Groups in Multnomah County |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
County Total |
Spatial Conversion Method #1: Area Ratio Portland Metro Interpolated Estimate |
Spatial Conversion Method #2: Roadway Coverage Portland Metro Interpolated Estimate |
Spatial Conversion Method #3: Control Zones Portland Metro Interpolated Estimate |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
County Total |
Census Block Groups |
County Total |
Census Block Groups |
County Total |
Census Block Groups |
|||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Characteristic |
ACS Estimate
|
Estim- ate
|
Diff- erence |
Interpolated Estimate Relationship to ACS 90% Confidence Interval |
Estim- ate
|
Diff- erence |
Interpolated Estimate Relationship to ACS 90% Confidence Interval |
Esti- mate
|
Diff- erence |
Interpolated Estimate Relationship to ACS 90% Confidence Interval |
||||||||||||||||||||||||||||||||||||||||||||||||
|
Within |
Greater Than |
Less Than |
Within |
Greater Than |
Less Than |
Within |
Greater Than |
Less Than |
||||||||||||||||||||||||||||||||||||||||||||||||||
|
Number of Households |
260,800 |
259,600 |
- 0.46 % |
44 % |
26 % |
30 % |
261,200 |
+ 0.15 % |
45 % |
28 % |
27 % |
258,300 |
- 0.96 % |
46 % |
29 % |
25 % |
||||||||||||||||||||||||||||||||||||||||||
|
Household Income Income Range 1 Income Range 2 Income Range 3 Income Range 4 |
64,900 53,000 46,200 96,700 |
79,800 57,200 43,900 77,900 |
+22.96 % +7.92 % -4.98 % -19.44 % |
82 % 83 % 87 % 87 % |
17 % 14 % 10 % 7 % |
1 % 3 % 3 % 5 % |
79,900 57,600 44,400 79,300 |
+ 23.11 % + 8.68 % - 3.90 % - 17.99 % |
82 % 91 % 94 % 97 % |
17 % 8 % 5 % 2 % |
1 % 1 % 1 % 1 % |
79,700 57,300 44,000 77,400 |
+ 22.80 % + 8.11 % - 4.76 % - 19.96 % |
81 % 91 % 95 % 97 % |
18 % 8 % 4 % 2 % |
1 % 1 % 1 % 1 % |
||||||||||||||||||||||||||||||||||||||||||
|
Age of Householder 64 years or less 65 years or more |
206,600 54,200 |
203,300 56,300 |
- 1.60 % + 3.87 % |
89 % 82 % |
7 % 9 % |
4 % 9 % |
204,700 56,500 |
+ 0.92 % + 4.24 % |
77 % 83 % |
15 % 8 % |
8 % 9 % |
202,000 56,000 |
- 2.23 % + 3.32 % |
90 % 91 % |
6 % 5 % |
4 % 4 % |
||||||||||||||||||||||||||||||||||||||||||
|
Persons in Household 1 person 2 persons 3 persons 4 or more persons |
87,300 88,600 36,700 48,200 |
82,000 85,800 39,000 52,800 |
- 6.07 % - 3.16 % + 6.27 % + 9.54 % |
50 % 65 % 78 % 90 % |
27 % 19 % 19 % 9 % |
23 % 16 % 3 % 1 % |
82,300 86,400 39,200 53,200 |
- 5.73 % - 2.48 % + 6.81 % + 10.37 % |
49 % 56 % 66 % 90 % |
29 % 25 % 26 % 8 % |
22 % 19 % 8 % 2 % |
81,600 85,300 38,800 52,600 |
- 6.53 % - 3.72 % + 5.72 % + 9.13 % |
59 % 67 % 77 % 90 % |
23 % 19 % 20 % 8 % |
18 % 14 % 3 % 2 % |
||||||||||||||||||||||||||||||||||||||||||
|
Means of Transportation to Work All Work Trips Drive Alone Carpool Transit Walk/Bike Other |
287,100 200,300 36,400 32,000 16,100 2,400 |
267,900 184,800 40,500 22,400 20,100 3,400 |
- 6.69 % - 7.74 % + 11.26 % - 30.00 % + 24.84 % + 41.67 % |
81 % 60 % 66 % 66 % 75 % - |
8 % 17 % 28 % 14 % 24 % - |
11 % 23 % 6 % 20 % 1 % - |
249,200 172,100 37,700 20,800 18,500 3,100 |
- 13.20 % - 14.08 % 0.00 % - 35.00 % + 14.91 % +29.17 % |
83 % 61 % 69 % 67 % 79 % - |
6 % 12 % 25 % 12 % 19 % - |
11 % 28 % 6 % 21 % 2 % - |
270,700 185,800 41,000 23,000 20,300 3,400 |
- 5.71 % - 7.24 % + 12.64 % - 28.13 % + 26.09 % + 41.67 % |
86 % 67 % 66 % 66 % 74 % - |
6 % 14 % 29 % 15 % 25 % - |
8 % 19 % 5 % 19 % 1 % - |
||||||||||||||||||||||||||||||||||||||||||
|
Vehicles Available (per household) |
1.53 (avg) |
1.50 (avg) |
- 1.96 % |
1.50 (avg) |
- 1.96 % |
1.54 (avg) |
+ 0.65 % |
|||||||||||||||||||||||||||||||||||||||||||||||||||
|
Travel Time to Work (minutes) |
23.25 (avg) |
24.71 (avg) |
+ 6.28 % |
|
|
|
27.20 (avg) |
+ 16.99 % |
|
|
|
25.27 (avg) |
+ 8.69 % |
|
|
|
||||||||||||||||||||||||||||||||||||||||||
Results of the ANOVA are shown in Table 4. Generally, the spatial basis change methods have the most impact on the transportation related variables. The null hypothesis is rejected for all transportation variables except vehicles available per household and two of the household income variables. For these variables, we conclude that the data resulting from the spatial conversion algorithms and the ACS data are not the same when the level of significance is 0.05.
|
Table 3 Percent Difference Between ACS and Portland Metro Estimates Prior to Spatial Basis Change |
||||
|
Multnomah County Total |
||||
|
Characteristic |
Year |
ACS Estimates for Block Groups Number of Households |
Portland Metro Estimates for TAZs Number of Households |
Percent Difference |
|
Number of Households |
1996 |
260,800 |
262,500 |
+ 0.65 % |
|
Household Income Income Range 1 Income Range 2 Income Range 3 Income Range 4 |
1996 |
64,900 53,000 46,200 96,700 |
80,200 57,800 44,600 79,800 |
+ 23.57 % + 9.06 % - 3.46 % - 17.48 % |
|
Age of Householder 64 years or less 65 years or more |
1996 |
206,600 54,200 |
205,800 56,700 |
- 0.39 % + 4.61 % |
|
Persons in Household 1 person 2 persons 3 persons 4 or more persons |
1996 |
87,300 88,600 36,700 48,200 |
82,600 86,900 39,500 53,500 |
- 5.38 % - 1.92 % + 7.63 % + 11.00 % |
|
Means of Transportation to Work All Work Trips Drive Alone Carpool Transit Walk/Bike Other |
1996 |
287,100 200,300 36,400 32,000 16,100 2,400 |
282,500 195,200 42,700 24,000 20,600 3,400 |
- 1.60 % - 2.55 % + 17.31 % + 31.88 % + 16.46 %
|
|
Vehicles Available |
1994 |
(average) 1.57 vehicles per hh |
(average) 1.56 vehicles per hh |
- 0.64 % |
|
Travel Time to Work |
1994 |
(average) 23.38 minutes |
N.A. |
- |
For each variable for which the previous null hypothesis was rejected, a separate ANOVA was run comparing only the spatial basis change methods to each other. In other words, the null hypothesis is:
H0: m1 = m2 = m3
Results from this analysis are in Table 5. The spatial basis change methods are not significantly different for the two income ranges. However, there is a significant difference among the methods for the transportation variables.
Paired T-tests between the ACS data and the data generated using method 1, areal interpolation, on household income ranges 1 and 4 result in rejecting the null hypothesis that the means are equal at a level of significance = 0.05. The same holds true comparing the ACS income data to the results from the other two spatial basis change methods. Consequently, none of the spatial basis change methods are capable of generating data similar to that collected in the ACS for household income ranges 1 and 4. However, part of this conclusion may be a function of the data conversion methods used to achieve comparable categories in each database prior to applying spatial basis change. Particularly, temporal adjustment and linear interpolation between categories may have an effect on the data.
Paired T-tests were also performed for the transportation variables to determine if any of the methods resulted in values that were similar to the ACS. For total number of work trips, the hypothesis, that the difference in the means equals zero, is rejected when comparing each method individually to the ACS. However, when comparing data from the areal interpolation method (1) and the control zone method (3) we fail to reject the hypothesis that the means are equal. Similar conclusions are reached for the drive work trips, carpool work trips, transit work trips, and walk/bike work trips. The exception is average travel time to work. In this case, none of the methods compare favorably with the ACS and the null hypothesis for the test comparing Method 1 and Method 3 is rejected.
Conclusions
From a research perspective, the big question at hand is how reliable are the household and transportation estimates of the ACS? For most household characteristics, local data falls within the ACS 90 percent confidence interval 80 to 90 percent of the time. For most transportation variables, local data falls within the ACS 90 percent confidence interval 75 to 90 percent of the time.
|
Table 4 ANOVA Results |
||||
|
Variable |
F |
p-value |
Fcrit |
Conclusion |
|
Number of households |
0.043552 |
0.987917 |
2.609099 |
Fail to reject |
|
Age of head 15-64 years old |
0.109285 |
0.954697 |
2.609099 |
Fail to reject |
|
Age of head 65+ years old |
0.392199 |
0.758634 |
2.609099 |
Fail to reject |
|
Household size – 1 person |
0.829145 |
0.477725 |
2.609099 |
Fail to reject |
|
Household size – 2 persons |
0.30456 |
0.822121 |
2.609099 |
Fail to reject |
|
Household size – 3 persons |
0.893733 |
0.443613 |
2.609099 |
Fail to reject |
|
Household size – 4+ persons |
1.656376 |
0.174417 |
2.609099 |
Fail to reject |
|
Household income – Range 1 |
10.06968 |
1.38E-06 |
2.609099 |
Reject |
|
Household income – Range 2 |
1.899782 |
0.127538 |
2.609099 |
Fail to reject |
|
Household income – Range 3 |
0.583593 |
0.625772 |
2.609099 |
Fail to reject |
|
Household income – Range 4 |
8.125438 |
2.22E-05 |
2.609099 |
Reject |
|
Total Work Trips |
129.3125 |
9.33E-77 |
2.609212 |
Reject |
|
Drive alone work trips |
93.98567 |
5.33E-57 |
2.609234 |
Reject |
|
Carpool work trips |
161.2389 |
7.45E-94 |
2.609234 |
Reject |
|
Transit work trips |
111.9429 |
4.6E-67 |
2.609298 |
Reject |
|
Walk/Bike work trips |
61.86093 |
2.63E-38 |
2.609184 |
Reject |
|
Vehicles available per HH |
1.482552 |
0.217342 |
2.609184 |
Fail to reject |
|
Travel time to work |
26.34294 |
1.01E-16 |
2.609184 |
Reject |
|
Table 5: ANOVA Comparison of Methods only |
||||
|
Variable |
F |
p-value |
Fcrit |
Conclusion |
|
Household income – Range 1 |
0.006299 |
0.993721 |
3.001389 |
Fail to reject |
|
Household income – Range 4 |
0.120231 |
0.886724 |
3.001389 |
Fail to reject |
|
Total Work Trips |
161.2782 |
2.49E-64 |
3.00156 |
Reject |
|
Drive alone work trips |
118.2347 |
1.68E-48 |
3.00156 |
Reject |
|
Carpool work trips |
186.1194 |
3.88E-73 |
3.00156 |
Reject |
|
Transit work trips |
193.0855 |
2.09E-75 |
3.001674 |
Reject |
|
Walk/Bike work trips |
63.47004 |
3.16E-27 |
3.001496 |
Reject |
|
Travel time to work |
15.02155 |
3.45E-07 |
3.001489 |
Reject |
Additional analysis reveals that the three spatial basis change methods generate comparable data to the ACS estimates for the socio-demographic data except certain income ranges. The three methods do generate data in these income ranges that are comparable with each other. None of the methods produces data comparable to each other or the ACS data for the transportation variables except average number of vehicles available to a household. However, for the remaining transportation variables, with the exception of average travel time to work, method 1 and method 3 produce comparable results by block group. Consequently areal interpolation yields similar results to the control zone method for all but one variable.
None of the methods are sufficient for reproducing transportation data similar to the ACS estimates. There are a few reasons this may occur. First, the Portland Metro data may be dissimilar to the ACS estimates given the different survey methods used. Additionally, no information on the sampling error associated with the Portland Metro survey is provided. According to Keith Lawton at Portland Metro, the activity-based survey in Portland undercounts the people who are too busy with activities to answer the CATI but there is no published information on the sources and sizes of sampling errors. Second, the transportation variables all required some type of transformation to generate similar categories for comparing ACS data to Portland Metro data. These transformations, including temporal, introduce error into the data that may be significant enough to impact the results from the spatial basis change methods. Third, none of the spatial basis change methods may capture the underlying geographical influences to adequately convert the data. For example, the number of work trips by transit is influenced more by accessibility (proximity) to transit services than the ubiquity of the street network. Also, carpool trips to work are greatly influenced by the proximity of those living near you or in your travel corridor who also work near you or in your travel corridor. None of these spatial relationships are considered by the spatial basis change methods described here.
Further consideration of the impacts on transforming variable types is certainly recommended. Unlike the geography problem, which will disappear when the journey to work coding of the ACS is performed on TAZs, this problem on characterization of data by Census and transportation planners will not change until planning models change. Consequently, planners will have to transform ACS data into the structure with which they are used to working.
References
Batty, M. And R. Sammons. 1978. On Searching For the Most Informative Spatial Pattern. Environment and Planning A. Vol. 10, pp. 747-779.
Batty, M. And P.K. Sikdar. 1982. Spatial Aggregation in Gravity Models. Environment and Planning A. Vol. 14, pp. 795-882.
Bracken, I. and D. Martin. 1995. Linkage of the 1981 and 1991 U.K. Censuses Using Surface Modelling Concepts. Environment and Planning A. vol. 27. pp. 379-390.
Caliper Corporation. 1996. Caliper TransCAD User’s Guide. Version 3.0 for use with MicroSoft Windows.
Fisher, P.F. and M. Langford. 1995. Modelling the Errors in Areal Interpolation Between Zonal Systems by Monte Carlo Simulation. Environment and Planning A. vol. 27. pp. 221-224.
Flood, Michael J. and Victor Siaurusaitis. 1997. Geographic Analysis Techniques for Use in Building or Updating a Travel Demand Model. Proceedings of the Institute of Transportation Engineers 67th Annual Meeting. 1997, Boston, Mass.
Flowerdew, R. and S. Openshaw. 1987. A Review of the Problem of Transferring Data from One Set of Areal Units to Another Incompatible Set. Northern Regional Research Laboratory, Universities of Lancaster and Newcastle-Upon-Tyne.
Flowerdew, R., M. Green, and E. Kehris. 1991. Using Areal Interpolation Methods in Geographic Information Systems. Papers in Regional Science. vol. 70. pp. 303-315.
Gan, Cheng-Tin. 1994. A GIS-Aided Procedure for Converting Census Data for Transportation Planning. ITE Journal. November, 1994. pp. 34-40.
Goodchild, M.F., L. Anselin, and U. Deichmann. 1993. A Framework for the Areal Interpretation of Socioeconomic Data. Environment and Planning A. vol. 25. pp. 383-397.
Greenshields, Bruce Douglas and Frank Mark Weida. 1978. Statistics, With Applications to Highway Traffic Analysis. Westport, Conn.: Eno Foundation for Transportation.
Griffith, Daniel. 1987. Spatial Autocorrelation: A Primer. Washington, D.C.: Association of American Geographers.
Hsiao, Shirley, Jian Lu, James Sterling, and Matthew Weatherford. 1997. Use of Geographic Information Systems for Analysis of Transit Pedestrian Access. Transportation Research Record 1604. pp. 50-59.
Implication of Continuous Measurement for the Uses of Census Data in Transportation Planning. Washington, D.C.: U.S. Department of Transportation, Bureau of Transportation Statistics.
Keck, J. Kevin. 1997. A GIS Based Census Block Group Level Transit Mode Analysis. Proceedings of the Institute of Transportation Engineers 67th Annual Meeting. 1997, Boston, Mass.
Lawton, T. Keith and Eric I. Pas. 1995. Resource Paper for Survey Methodologies Workshop. Proceedings of a Conference on Household Travel Surveys: New Concepts and Research Needs. March 12-15, 1995, Irvine, California.
O’Neill, Wende A., R. Douglas Ramsey, and JaChing Chou. 1992. Analysis of Transit Service Areas Using Geographic Information Systems. Transportation Research Record 1364. pp. 131-138.
Openshaw, S. and P.J. Taylor. 1981. The Modifiable Area Unit Problem. in Quantitative Geography: A British View. eds. Wrigley, N., R.J. Bennett. London: Routledge & Kegan Paul. pp. 60-70.
Ort_zar, Juan de Dios and Luis G. Willumsen. 1990. Modelling Transport. New York, New York: John Wiley & Sons.
Peng, Zhongren and Kenneth J. Dueker. 1993. Error and Accuracy in Spatial Data Allocation. Proceedings of GIS/LIS Conference. 1993, Minneapolis, Minn.
Racca, David P. 1998. Using GIS to Identify Markets for Transit in Delaware. Proceedings of the Geographic Information Systems for Transportation Symposium. 1998, Salt Lake City, Utah.
Wright, Tommy. 1998. Sampling and Census 2000: The Concepts. American Scientist. vol. 86. pp. 245-253.
Zhao, Fang. 1999. GIS Analysis of the Impact of Community Design on Transit Accessibility. (Working paper.) Miami, Florida: Florida International University.