by lpetrich
It's been hard for me to find a good introduction to ridership-prediction models for urban transit and intercity rail lines.
However, I've found
Intercity Rail Ridership Forecasting and the Implementation of High-Speed Rail in California [eScholarship]
Alon Levy's blog entry on Sanity Checks on HSR Ridership | Pedestrian Observations
They mention a "gravity model" that goes something like this:
T(i,j) = P(i)*P(j)/C(i,j)
where T is the number trips, the P's are the origin and destination populations or places to go, and C is a cost function, often modeled as a function of distance or travel time. I've found mentions of gravity models in various other places with the help of Google Scholar.
Alon Levy mentioned that SNCF proposes that the populations be taken to some power of 0.8 to 0.9 for best fit.
I can find riders per station for some systems, and even station-to-station ridership for one system: BART - Monthly Ridership Reports. It would be difficult to test station-to-station ridership models without a lot of detail on urban geography, however. But it may be possible to test hypotheses of preferred destinations.
Also, for urban-rail systems, it may be possible to find ridership statistics over the years for some of the more recent urban-rail systems, since many of them have been tended over the years. So one could test some hypotheses about whether the number of riders is linear (if the lines radiate out from the biggest destination) or more than linear (if many riders do multi-line trips). Places like Portland OR, Sacramento, San Diego, and San Jose.
-
On the intercity side, I can't find any station-to-station statistics anywhere in Amtrak's site, though I can find numbers of riders per station and riders for some routes.
So I did something that has a certain apples-and-oranges risk: use airline ridership statistics. I used US ones from the US DoT. The gravity model didn't work very well. Instead, I found a model with a sum instead of a product of populations:
T(i,j) = (P(i)+P(j))/C(i,j)
Curiously, my estimated cost function was nearly flat as a function of distance. I note that the data points have a *lot* of scatter, and that they are bimodal. I found a lot of heavy-traffic routes and a lot of light-traffic ones, with a dividing line of about 6000 trips/direction/year.
But I found a page that suggests something about the rail cost function: Microsoft Word - Final report FINAL.doc - 2006_08_study_air_rail_competition_en.pdf The rail fraction of rail/air ridership can be approximately fit with a line from 100% rail at 1h 30m to 0% rail at 5h 30m. The halfway point is 3h 30m, and that is where the rail cost function likely passes the air one.
However, I've found
Intercity Rail Ridership Forecasting and the Implementation of High-Speed Rail in California [eScholarship]
Alon Levy's blog entry on Sanity Checks on HSR Ridership | Pedestrian Observations
They mention a "gravity model" that goes something like this:
T(i,j) = P(i)*P(j)/C(i,j)
where T is the number trips, the P's are the origin and destination populations or places to go, and C is a cost function, often modeled as a function of distance or travel time. I've found mentions of gravity models in various other places with the help of Google Scholar.
Alon Levy mentioned that SNCF proposes that the populations be taken to some power of 0.8 to 0.9 for best fit.
I can find riders per station for some systems, and even station-to-station ridership for one system: BART - Monthly Ridership Reports. It would be difficult to test station-to-station ridership models without a lot of detail on urban geography, however. But it may be possible to test hypotheses of preferred destinations.
Also, for urban-rail systems, it may be possible to find ridership statistics over the years for some of the more recent urban-rail systems, since many of them have been tended over the years. So one could test some hypotheses about whether the number of riders is linear (if the lines radiate out from the biggest destination) or more than linear (if many riders do multi-line trips). Places like Portland OR, Sacramento, San Diego, and San Jose.
-
On the intercity side, I can't find any station-to-station statistics anywhere in Amtrak's site, though I can find numbers of riders per station and riders for some routes.
So I did something that has a certain apples-and-oranges risk: use airline ridership statistics. I used US ones from the US DoT. The gravity model didn't work very well. Instead, I found a model with a sum instead of a product of populations:
T(i,j) = (P(i)+P(j))/C(i,j)
Curiously, my estimated cost function was nearly flat as a function of distance. I note that the data points have a *lot* of scatter, and that they are bimodal. I found a lot of heavy-traffic routes and a lot of light-traffic ones, with a dividing line of about 6000 trips/direction/year.
But I found a page that suggests something about the rail cost function: Microsoft Word - Final report FINAL.doc - 2006_08_study_air_rail_competition_en.pdf The rail fraction of rail/air ridership can be approximately fit with a line from 100% rail at 1h 30m to 0% rail at 5h 30m. The halfway point is 3h 30m, and that is where the rail cost function likely passes the air one.