Transit smart card data mining for passenger origin information extraction
- 10 October 2012
- journal article
- Published by Zhejiang University Press in Journal of Zhejiang University SCIENCE C
- Vol. 13 (10), 750-760
- https://doi.org/10.1631/jzus.c12a0049
Abstract
The automated fare collection (AFC) system, also known as the transit smart card (SC) system, has gained more and more popularity among transit agencies worldwide. Compared with the conventional manual fare collection system, an AFC system has its inherent advantages in low labor cost and high efficiency for fare collection and transaction data archival. Although it is possible to collect highly valuable data from transit SC transactions, substantial efforts and methodologies are needed for extracting such data because most AFC systems are not initially designed for data collection. This is true especially for the Beijing AFC system, where a passenger’s boarding stop (origin) on a flat-rate bus is not recorded on the check-in scan. To extract passengers’ origin data from recorded SC transaction information, a Markov chain based Bayesian decision tree algorithm is developed in this study. Using the time invariance property of the Markov chain, the algorithm is further optimized and simplified to have a linear computational complexity. This algorithm is verified with transit vehicles equipped with global positioning system (GPS) data loggers. Our verification results demonstrated that the proposed algorithm is effective in extracting transit passengers’ origin information from SC transactions with a relatively high accuracy. Such transit origin data are highly valuable for transit system planning and route optimization.Keywords
This publication has 18 references indexed in Scilit:
- Transit Stop-Level Origin–Destination Estimation through Use of Transit Schedule and Automated Data Collection SystemTransportation Research Record: Journal of the Transportation Research Board, 2011
- Travel Time and Transfer Analysis Using Transit Smart Card DataTransportation Research Record: Journal of the Transportation Research Board, 2010
- Markov models for Bayesian analysis about transit route origin–destination matricesTransportation Research Part B: Methodological, 2009
- Use of Entry-Only Automatic Fare Collection Data to Estimate Linked Transit Trips in New York CityTransportation Research Record: Journal of the Transportation Research Board, 2009
- Enriching Archived Smart Card Transaction Data for Transit Demand ModelingTransportation Research Record: Journal of the Transportation Research Board, 2008
- Constructing an Automated Bus Origin–Destination Matrix Using Farecard and Global Positioning System Data in São Paulo, BrazilTransportation Research Record: Journal of the Transportation Research Board, 2008
- Integrating Bayesian networks and decision trees in a sequential rule-based transportation modelEuropean Journal of Operational Research, 2006
- Origin and Destination Estimation in New York City with Automated Fare System DataTransportation Research Record: Journal of the Transportation Research Board, 2002
- The computational complexity of probabilistic inference using bayesian belief networksArtificial Intelligence, 1990
- LII. An essay towards solving a problem in the doctrine of chances. By the late Rev. Mr. Bayes, F. R. S. communicated by Mr. Price, in a letter to John Canton, A. M. F. R. SPhilosophical Transactions of the Royal Society of London, 1763