1. Data Source:
Synthetic data generated and inspired from the paper: Length-weight relationships of nine fish species from the Tetulia River, southern Bangladesh: https://www.researchgate.net/figure/Descriptive-statistics-and-estimated-length-weight-relationship-W-aL-b-W-in-g-and-L_tbl1_280916140
2. Meta Data:
species: species name
length: length (cm)
weight: weight (g)
w_l_ratio: weight / length
3. Usage:
Exploratory Data Analysis (EDA): Understand the distributions, relationships, and patterns within the data.
Classification: Predict the species based on height and length, W/L ratio
Version update note:
- Version 1 is based on the data and statistics from the paper "Length-weight relationships of nine fish species from the Tetulia River, southern Bangladesh."
- Version 2 and Version 3 have been adjusted to improve classification accuracy. These versions offer better performance due to the adjustments made for clearer differentiation between clusters.
- Feel free to explore the differences between these versions. If you're building a model based on real-life survey data, Version One is recommended. However, please note that the model performance may be less optimal since the clusters in this version tend to overlap more.
Feel free to leave comments on the discussion. I'd appreciate your upvote if you find my dataset useful! 😀