Sample Multiple Linear Regression Data
Synthesised data to practice multiple linear regression techniques
@kaggle.neilhoneyman_sample_multiple_linear_regression_data
Synthesised data to practice multiple linear regression techniques
@kaggle.neilhoneyman_sample_multiple_linear_regression_data
A synthetic dataset to practice multiple linear regression techniques.
The CSV contains 200 rows of clean data and 5 variables for testing, including 2 categorical independent variables, 2 continuous independent variables, and a dependent variable.
The practice scenario is a study into the effects of certain variables on academic exam scores. The data can be analysed to identify whether there is any linear correlation between the independent and dependent variables.
This dataset is particularly useful for practising techniques from course 5 of the Google Advanced Data Analytics Specialization: Regression Analysis: Simplify Complex Data Relationships
CREATE TABLE sample_regression_data (
"study_hours" DOUBLE,
"sleep_hours" DOUBLE,
"location_type" VARCHAR,
"tutoring_access" VARCHAR,
"performance_score" DOUBLE
);
Anyone who has the link will be able to view this.