CEPHIA HIV Recency Assay Data
Global Assay Results and Participant Characteristics
@kaggle.thedevastator_cephia_hiv_recency_assay_data
Global Assay Results and Participant Characteristics
@kaggle.thedevastator_cephia_hiv_recency_assay_data
By [source]
The Consortium for the Evaluation and Performance of HIV Incidence Assays (CEPHIA) is continually striving to deepen understanding of HIV epidemiology around the world. By collecting and testing samples from collaborations across the globe they are able to monitor the accuracy and precision of HIV recency assays. This dataset contains assay results plus corresponding participant characteristics, enabling researchers to gain knowledge about both incidence rates as well as long-term dynamics in different cohorts throughout numerous countries.
This data set provides key information such as assay type, specimen type, testing laboratory, participant demographic factors (e.g., sex and country), HIV status at visit time, cohort entry HIV status, elite controller status over time and antiretroviral use history (both current ART treatment & past first treatment episode). Plus viral load test results with related information such as closest measure to visit date offset , sensitivity level , EDDI interval size and number of days since EDDI for enhanced analysis capabilities. All together these variables make this a powerful tool allowing you to probe a myriad of questions ranging from understanding how incidence changes over time by population or country & reducing infection levels in especially vulnerable communities through to exploring potential interactions between other factors such as wealth or gender based disparities in those affected by this virus
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset provides a comprehensive look at CEPHIA collaborations across the world to evaluate the accuracy of HIV recency assays. It contains information on assay results and participant characteristics such as HIV status, HIV subtype, country of origin and demographics. The data can be used to gain insight into global trends in HIV incidence and dynamics.
To get started with this dataset, explore the different columns available to you such as assay, cephia_panel, testing_laboratory, etc. These will give an indication of what kind of assay was used, where it was conducted and what samples were tested. Then look at the other columns which provide more detailed information about each participant such as their HIV subtype, HIV status at visit and visit date.
Once you have familiarized yourself with the column titles, start by selecting only those that are relevant for your analysis - there is no need to include all columns if they don't add value your analysis. This will reduce clutter and make analysing your data much easier.
Finally if you have any questions or would like further explanation on any aspect of this dataset please refer to CEPHIA's website or contact them directly for help!
- Using the HIV subtype and HIV treatment information, researchers can develop and evaluate models that predict treatment effectiveness for different types of HIV.
- Examining the viral load closest to a certain visit date, as well as the viral load type used, allows researchers to better understand the dynamics of viral load within cohorts.
- Analyzing designated-elite controllers during visits can help characterize and track times where a person is intermittently controlling their infection without medication over time allowing investigators to investigate how this occurs in different patient populations with different responses to medications
If you use this dataset in your research, please credit the original authors.
Data Source
License: CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
No Copyright - You can copy, modify, distribute and perform the work, even for commercial purposes, all without asking permission. See Other Information.
File: cephia_public_use_dataset_20210604.csv
| Column name | Description |
|---|---|
| assay | The type of assay used to test the specimen. (String) |
| cephia_panel | The CEPHIA panel used for testing the specimen. (String) |
| testing_laboratory | The laboratory performing the testing procedures. (String) |
| test_date | The date the specimen was tested. (Date) |
| assay_result_field | The field containing the results obtained from the assay. (String) |
| assay_result_value | The value of the results obtained from the assay. (Numeric) |
| assay_result_method | The method used to obtain the results from the assay. (String) |
| specimen_type | The type of specimen used for the assay. (String) |
| hiv_status_at_visit | The HIV status of the participant at the time of the visit. (String) |
| cohort_entry_hiv_status | The HIV status of the participant at the time of cohort entry. (String) |
| days_since_cohort_entry | The number of days since the participant entered the cohort. (Numeric) |
| hiv_subtype | The HIV subtype of the participant. (String) |
| hiv_subtype_confirmed | Whether the HIV subtype of the participant has been confirmed. (Boolean) |
| country | The country the participant is from. (String) |
| sex | The sex of the participant. (String) |
| age_in_years | The age of the participant in years. (Numeric) |
| eddi_interval_size | The size of the EDDI interval. (Numeric) |
| days_since_eddi | The number of days since the EDDI. (Numeric) |
| days_since_ep_ddi | The number of days since the EP DDI. (Numeric) |
| days_since_lp_ddi | The number of days since the LP DDI. (Numeric) |
| designated_as_elite_controller_at_visit | Whether the participant was designated as an elite controller at the time of the visit. (Boolean) |
| ever_designated_as_elite_controller | Whether the participant has ever been designated as an elite controller. (Boolean) |
If you use this dataset in your research, please credit the original authors.
If you use this dataset in your research, please credit .
CREATE TABLE cephia_public_use_dataset_20210604 (
"assay" VARCHAR,
"cephia_panel" VARCHAR,
"testing_laboratory" VARCHAR,
"test_date" TIMESTAMP,
"assay_result_field" VARCHAR,
"assay_result_value" DOUBLE,
"assay_result_method" VARCHAR,
"specific_result_identifier" BIGINT,
"generic_result_identifier" BIGINT,
"participant_identifier" BIGINT,
"visit_identifier" BIGINT,
"specimen_type" VARCHAR,
"hiv_status_at_visit" VARCHAR,
"cohort_entry_hiv_status" VARCHAR,
"days_since_cohort_entry" BIGINT,
"hiv_subtype" VARCHAR,
"hiv_subtype_confirmed" BOOLEAN,
"country" VARCHAR,
"sex" VARCHAR,
"age_in_years" DOUBLE,
"eddi_interval_size" DOUBLE,
"days_since_eddi" DOUBLE,
"days_since_ep_ddi" DOUBLE,
"days_since_lp_ddi" DOUBLE,
"designated_as_elite_controller_at_visit" BOOLEAN,
"ever_designated_as_elite_controller" BOOLEAN,
"treatment_naive_at_visit" BOOLEAN,
"on_treatment_at_visit" BOOLEAN,
"first_treatment_episode" VARCHAR,
"days_since_first_art" DOUBLE,
"days_since_current_art" DOUBLE,
"days_from_eddi_to_first_art" DOUBLE,
"days_from_eddi_to_current_art" DOUBLE,
"viral_load_closest_to_visit" DOUBLE,
"viral_load_date_offset_from_visit_date" DOUBLE,
"viral_load_type" VARCHAR,
"viral_load_detectable" VARCHAR,
"cd4_count_at_visit" DOUBLE
);Anyone who has the link will be able to view this.