TRANSJAKARTA - Public Transportation - Transaction Data
When a data analyst want to build the framework for the analysis they should not have waited for the real transaction to fill in from time to time. They could try to create a dummy data for testing whether the framework or the data structure already meet the requirement for deep analytics. Here i tried to simulate transaction data for Transjakarta as i found none on the Internet that is publicly shared. Hope you can exercise with this data i simulate to make it more meaningful as the master data from this data are real (but with dummy transactions).
The master datas are sourced from: https://ppid.transjakarta.co.id/pusat-data/data-terbuka/transjakarta-gtfs-feed
The data was generated using Python using Faker and Random based on master datas. The source might be updated from time to time and the dataset might not represent the latest version from the source.
Context:
Transjakarta is public transportation company from Indonesia, based in Jakarta. The transportation modes are big bus (BRT), medium and big bus (non-BRT), mini bus (Mikrotrans). The mechanism in Transjakarta is to Tap-In and Tap-Out using payment card as your tickets.
Content:
Basically this data is simulation for Transaction data in Transjakarta. It does not represent the real data / structure used in Transjakarta
Inspiration:
Transjakarta is growing as public transportation company. But none have shared data for the transaction analysis. We can analyze which route are busy and not. Which route is heavy with traffic jam or not. And other dimension provided you can analyze.
*If you'd like to see how i created this dataset you can peek the process in my GitHub