Module: Transport Data Science
2025-02-12
In other words…
Data science spin-out company: ImpactML
The Bureau of Labor Statistics in the US projects a 35% increase in data science roles in decade 2022-2032.” Source: visualisecurious.com
2017: Transport Data Science created, led by Dr Charles Fox, Computer Scientist, author of Transport Data Science book (Fox, 2018)
The focus was on databases and Bayesian methods
2019: I inherited the module, which was attended by ITS students
Summer 2019: Python code published in the module ‘repo’:
Milestone passed in my academic career, first online-only delivery of lecture (ITSLeeds?), seems to have worked, live code demo with #rstats/rstudio, recording, chat + all🎉
Thanks students for ‘attending’ + remote participation, we’ll get through this together.#coronavirus pic.twitter.com/wlAUxmZj5r— Robin Lovelace March 17, 2020
See the reading list for details
Understand the structure of transport datasets
Understand how to obtain, clean and store transport related data
Gain proficiency in command-line tools for handling large transport datasets
Produce data visualizations, static and interactive
Learn how to join together the components of transport data science into a cohesive project portfolio
The module is taught by two really well organised and enthusiastic professors, great module, the seminars, structured and unstructured learning was great and well thought out, all came together well
I wish this module was 60 credits instead of 15 because i just want more of it.
See the schedule for details
Transport modelling software products are a vital component of modern transport planning and research.
It would not be an overstatement to say that software determines the range of futures that are visible to policymakers. This makes status of transport modelling software and how it may evolve in the future important questions.
What will transport software look like? What will their capabilities be? And who will control? Answers to each of these questions will affect the future of transport systems.
4-stage model still dominates transport planning models (Boyce and Williams 2015)
Impacts the current software landscape
Dominated by a few proprietary products
Limited support community online
High degree of lock-in
Limited cross-department collaboration
Sample of transport modelling software in use by practitioners.
Software | Company/Developer | Company HQ | Licence | Citations |
---|---|---|---|---|
Visum | PTV | Germany | Proprietary | 1810 |
MATSim | TU Berlin | Germany | Open source (GPL) | 1470 |
TransCAD | Caliper | USA | Proprietary | 1360 |
SUMO | DLR | Germany | Open source (EPL) | 1310 |
Emme | INRO | Canada | Proprietary | 780 |
Cube | Citilabs | USA | Proprietary | 400 |
sDNA | Cardiff University | UK | Open source (GPL) | 170 |
Getting help is vital for leaning/improving software
“10-Hour Service Pack $2,000” (source: caliper.com/tcprice.htm)
Source: https://community.rstudio.com/about
A fundamental part of data science is being able to understand your data.
That requires visualisation, R is great for that:
Now we have data in our computer, and verified it works, we can use it
Which places are most car dependent?
How to decide?