HELIX01-04 Part 01 | Sequencing Runs From Motif-based DNA Data Storage Systems
@zenodo.oai_zenodo_org_15839390
@zenodo.oai_zenodo_org_15839390
Helixworks has developed a novel DNA data storage method that encodes digital information using composite motifs - short, predefined DNA sequences assembled in specific orders. Unlike base-by-base synthesis, this approach uses enzymatic ligation of DNA building blocks (called motifs), allowing for high-throughput, cost-efficient synthesis with built-in error tolerance. Each oligo in the storage pool contains a structured arrangement of addressing barcodes, primers, and payload motifs, enabling robust multiplexing and retrieval. Information is read out via Oxford Nanopore sequencing and decoded by identifying the motifs present in each read. This architecture supports scalable, high-density molecular storage and can be integrated with automated workflows for synthesis and sequencing. This dataset contains raw and processed outputs from a Helixworks composite motif-based DNA data storage experiment. Each archive includes: Raw signal-level sequencing data (FAST5) – Generated via Oxford Nanopore sequencing (R10.4.1 / FLO-MIN114) of synthetic oligos assembled using Helixworks’ composite motif ligation protocol. Design file (_encoded.tsv) – Lists the intended motif sequence per oligo, including barcode and primer motifs, which define the structure and payload mapping. Zero-error motif alignment file (_master_db.txt) – Captures high-confidence motif alignments per read with no substitution, insertion, or deletion errors. Each row follows the format: [read_id] [filename] [barcode_id] [strand] [start] [end] [motif_id] [row_index] [column_index] Used to validate motif calling accuracy and reconstruct encoded payloads from sequencing reads. A detailed description of the oligo architecture, including inner address and outer barcode arrangement, primer locations, and motif layout can be found in the linked OSF DOI. Note: The HELIX01-04 sequencing run has been split into two parts. This archive contains Part 1 of the dataset.
Publisher name: Helixworks Technologies
Last updated: 2026-02-20T14:15:32Z
Share link
Anyone who has the link will be able to view this.