If you like the dataset, please upvote it!
This dataset contains time series data for Aldi, ASDA, Morrisons, Tesco and Sainsbury's. The data is daily from 20240109 until the last update.
Here is a medium article, explaining how the data is curated: https://medium.com/@decmca21/creating-a-price-comparison-site-for-the-uks-top-5-supermarkets-with-python-github-and-streamlit-bd20b6f16ff2
Here is a medium article, explaining how the data is analysed: https://medium.com/@decmca21/create-a-price-comparison-site-for-the-uks-top-5-supermarkets-with-python-github-and-streamlit-30ed8dca4eb4
It contains all information downloaded from the respective supermarket for each day.
This includes: Product Name; Price; Price per Unit; date; category; if the product is own brand or not.
Please note that the dataset relies on a web scraper that is not perfectly reliable. The data quality can be viewed on the data quality tab here: https://uksupermarketscraping.streamlit.app. For the most part, the information is highly accurate. However, due to a glitch I did not fix initially, until 20240127 the Morrisons web scraper data quality is variable (anywhere between approx 15,000 and 25000 products are web scraped).
All of the products show the lowest price (Nectar, Clubcard etc) and any multi-buy discounts (3 for 2, meal deal etc) are excluded.
The data is only available for days on which i ran the webscraper. As this is easy to do, it is available most days, but occasionally i am travelling so it is not run.
Otherwise, the data is good :).