Baselight

Indian Railways Schedule-Prices-Availability Data

Comprehensive Dataset Featuring Public Trains Scrapped from IRCTC in 10-2023

@kaggle.bhavyarajdev_indian_railways_schedule_prices_availability_data

About this Dataset

Indian Railways Schedule-Prices-Availability Data

Context

This dataset originates from my project that aims to improve the accessibility of train travel in India. The project helps travelers find and book indirect train journeys, which can be useful for long or remote trips where direct trains are scarce. The dataset is based on this project and covers the gap left by other platforms that only show direct trains, which may not suit some travelers' needs.

Sources

The primary source of this dataset is the Indian Railways Catering and Tourism Corporation (IRCTC) website during October-2023. Through web scraping techniques, data was collected, including public train schedules, pricing details, and availability information.

Project on GitHub

The Project Report on my GitHub Repository explains how I analyzed the prices and their dependencies, and why I chose decision tree regression as the best ML model for this case. You can find all the code regarding the data collection, model building and final output generation in the repository. Users here can perform time-series analysis on the prices and availability data, improve the accuracy of price prediction and create a model for availability prediction.

Share link

Anyone who has the link will be able to view this.