Xu Wang 1 Lina Yao 2 Fu Xiao 3 Jianbo Li 4 Xin Miao 1 and Zheng Yang 1 1 School of Software Tsinghua University 2 School of Computer Science and Engineering The University of New South ID: 813176
Download The PPT/PDF document "Jing Chu 1 , Kun Qian 1" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
Jing Chu1, Kun Qian1, Xu Wang1, Lina Yao2, Fu Xiao3, Jianbo Li4, Xin Miao1 and Zheng Yang11School of Software, Tsinghua University2School of Computer Science and Engineering, The University of New South Wales3School of Computer Science, Nanjing University of Posts and Telecommunications4College of Computer Science and Techonology, Qingdao UniversityPresenter: Jing Chu
Passenger Demand Prediction with Cellular Footprints
1
Slide2Motivation2Imbalance Passenger Demand PredictionPassenger Waiting TimeDriver Profit
Slide3MotivationCellular DataUser LocationMobility ModellingData Traffic EngineeringUser Portrait Research
Can get
crowd related data
compared with
the data from online car-hailing platform
Related WorkData sparsity problemLack of crowd flow analysisTarget to the number of pick-ups, ignoring potential passengers who eventually give up taking taxisImproper handling of spatial relationsFixed grid regionFail to capture the complex spatial dependencyReflect potential passenger demandUsing deep learning model to predict citywide passenger demand with
cellular data on flexible region partition accurately
Slide5Challenges
Slide6System OverviewDatabase server module Pre-processing moduleDeep learning moduleVisualization and evaluation module
Slide7System OverviewDatabase server module: Stores the big cellular data and provides retrival and aggregation services for fast preprocessingPre-processing moduleDeep learning moduleVisualization and evaluation module
Slide8Data SourceCellular data: Collected by a major cellular carrier of China Weather data: From Dark Sky API290 Billion Cellular Records1.5 Million Covered UsersDec 5th, 2016-Feb 4th 20178000 Cell TowersShenyang City, ChinaWeather StatesTemperature
Wind SpeedVisibility
Record: Unique
anonymized user ID,
create
time,
cell
tower ID, App ID, URL
Slide9System OverviewDatabase server modulePre-processing module: Passenger information extraction and flexible region partition Deep learning moduleVisualization and evaluation module
Slide10Passenger Demand Extraction Intercepting analysis of network data packets: Using HTTP proxy tool to understand the meaning behind URLs of DiDi ChuxingExtract passenger demand from cellular data
Slide11Region PartitionPartition the city by primary road network from OpenStreetMapFiner-grained partition: Partition the city by the main secondaryroads if the passenger demand of any block is too high
Slide12Crowd Outflow ExtractionCrowd outflow: The number of people leaving the regionOutflowStartOther regionsA region
Slide13System OverviewDatabase server modulePre-processing moduleDeep learning module: Deep learning architecture FlowFlexDP to model and predict the passenger demand for each regionVisualization and evaluation module
Slide14Deep Learning Architecture FlowFlexDPPredict the passenger demand for each region
Slide15Graph Convolutional Neural NetworkModel spatial dependency: Graph Convolutional Neural NetworkSpectral Graph Theory The normalized graph LaplacianConvolution theoremPolynomial filter: Consider k-order neighbors
Slide16GCNN with Residual LearningCapture the long-distance spatial dependency: Apply residual learning to GCNNResidual learningThe residual unit of GCNN
Slide17Sequences FusionPassenger demand = Hourly + DailyClosenessPeriodicityCrowd Outflow Sequences FusionCross Correlation CoefficientOffice RegionResidential Region
Slide18External Factors FusionWeather & Time MetadataHolidaySnowy DayPredicted PassengerDemandWeatherΣRegressionPassenger Demand
Crowd Outflow
Time Metadata
Parametric-matrix Based Fusion
Slide19System OverviewDatabase server modulePre-processing moduleDeep learning moduleVisualization and evaluation module
Slide20Experimental SettingTraining datasetThe first 65% for training data, the second 10% for validation set and the rest 25% for testingSpatio-temporal sequence: Min-Max normalization to [0,1]External factors: One-hot coding for day-of-week, time-of-day, holidays, weather state; Min-Max normalization for the temperature, wind speed and visibilityThe length of hourly sequence: {3, 4, 5, 6, 7, 8}The length of daily sequence: {1, 2, 3, 4, 5, 6, 7, 8}
Slide21EvaluationBaselinesHA: The Historical Average modelARIMA: The Autoregressive Integrated Moving AverageSARIMA: The Seasonal Autoregressive Integrated Moving Average modelVAR: Vector Auto-RegressiveLSTM: Long-Short Term Memory is a Recurrent Neural Network architectureANN: The Artificial Neural NetworkMetric: RMSE
Slide22ResultsPerformance EvaluationResGCNN: Combines GCNN and residual learningResGCNN-D: Further adds the passenger demand daily sequenceResGCNN-DO: Further fuses the crowd outflow hourly and daily sequencesFlowFlexDP: Our final modelAll variants achieve better performance than the baseline by at least 12.99%
Slide23VisualizationOur prediction results are very close to the real state
Slide24ConclusionWe propose a deep learning model FlowFlexDP, which usesGCNN and residual learning for predicting passenger demandon flexible region partitionEvaluation results on a large-scale real data set show that our model outperforms existing modelsWe demonstrate cellular data as a rich data source forpassenger demand prediction, which has been largelyoverlooked and unexplored previouslyIt is the first work that uses crowd flow information fromcellular data for passenger demand prediction
Slide2525Thanks Q&A