INTRODUCTION
With the explosive growth of data in the present day, nonvolatile memory based on semiconductors has reached the limits of Moore’s Law, and new media are necessary to store unbelievably large amounts of data (
1–
4). DNA has the potential to become the next-generation information storage medium due to its high storage density (about 455 Exabytes/g) and long-term storage time without electrical maintenance (
5–
9). The general process of existing DNA-based data storage typically includes data writing, storage, indexing, modifying, and reading. Among them, high-throughput DNA synthesis (data writing) and high-throughput DNA sequencing (data reading) technologies benefit from the rapid development of biochemical research and have already been commercialized semi-automatically on instruments (
4,
10). Although DNA synthesis costs are still very high, it is expected to be used on a large scale in the next decade (
11). However, no existing DNA data storage device has yet achieved the fundamental storage functionalities and robust data management capabilities comparable to commercial storage systems, which will be essential when that time arrives.
Advanced storage devices have strong expansibility and versatility and share a key feature: They enable data interactions at a single physical location through optical, electrical, or magnetic mechanisms, facilitating efficient data management and manipulation by supporting file storage, retrieval, repeated access, deletion, and replacement. As DNA data storage technology advances, DNA is frequently combined with material substrates to enhance data retention duration (
12–
15), reduce polymerase chain reaction (PCR) bias (
12,
16,
17), simplify operational processes (
18–
22), and provide indexing capabilities (
12,
16). These are typically prepared in particle form to provide physical partitions that distinguish between files. However, they are different in material composition, structural form, and fabrication methods, which makes integration different. Furthermore, establishing the storage system and addressing target files and repeated operations on these files require constant switching between manual and instrumental processes, which is labor-intensive and results in a notable waste of consumables (table S5). There has been exciting progress in introducing microfluidic chips to integrate the process of DNA data storage, such as DNA synthesis and sequencing (
23), DNA retrieval process (
24–
26), and DNA encapsulation/decapsulation process (
8,
9); however, the versatility of channel-based microfluidic chips is poor due to their complex channel designs and extensive pump-valve structures, face challenges in integrating additional functions and processes. Digital microfluidic chips require locating the target DNA file on a glass substrate and manually installing it during extraction (
25), which leads to poor scalability for large-scale data processes. In addition, some materials [e.g., polydimethylsiloxane (
26) and glass (
24)] are used to prepare microfluidic chips due to their excellent processing properties. The low space utilization of substrate materials leads to low DNA load (table S5). Thus, a material system has not been developed that realizes the entire DNA storage process and enables compact data storage with completely automated file operations.
Furthermore, although DNA data storage is traditionally considered suitable for cold data storage (
7,
12,
16,
27), a long-standing challenge is to design a platform that enables rapidly loading DNA files, high-speed file addressing, multiple files recovery, multiple files replacement, and the long-term stability of DNA data and could even handle warm data and serve as a personal big data storage hub. Here, we propose a DNA-based storage device with a form factor similar to that of a magnetic tape, named “DNA cassette tape” (
Fig. 1A). DNA cassette tape (DNA tape) is composed of a polyester-nylon composite membrane with excellent mechanical properties and has a large number of hydrophilic and hydrophobic spaces to form barcode patterns. The barcodes are continuously arranged on DNA tape by laser inkjet printing, generating 5.45 × 10
5 addressable data partitions per 1000-m tape and achieving an addressing rate of 1570 partitions/s. In addition, the deposit-many-recover-many (DMRM) feature can independently operate on each data partition of DNA tape. Zeolitic imidazolate frameworks (ZIFs) as a protective layer can be synthesized on DNA tape in situ for long-term data storage, realize rapid encapsulation within 10 s, and decapsulation within 10 min on DNA tape. Last, we developed a compact DNA cassette tape drive for DNA tape (
Fig. 1B), which can perform file addressing, decapsulation, encapsulation, recovery, removal, and redeposition operations on DNA files quickly and automatically.
DISCUSSION
In this work, we established a DNA-based data storage cassette tape model, which has the following advantages and disadvantages:
First, the rolled configuration of the DNA tape efficiently maximizes the spatial utilization of the material, enabling portability and extending the number of available areas and storage capacity by increasing its length. By transforming a two-dimensional plane into a three-dimensional structure, DNA tape allows continuous extension in one direction while maintaining a compact volume, enhancing the density of available area per unit volume. In addition, by incorporating barcode patterns without expanding the overall volume of the roll, DNA tape achieves addressable physical partitions with a density of 364 partitions/cm
3. This partition density is 30 times higher than that of the digital microfluidic (DMF) chip (table S5 and text S1) despite each partition on the DNA tape using ~4 mm
2, which is four to five times the area of each spot (a radius of ~1 mm) deposited on the DMF chip. However, the partition density on DNA tape is lower than that of current particle-based storage (table S5). Nevertheless, when attempting to store and manipulate 10
5 files, particle-based storage technology requires semi-automated operation within 615 liters of physical space (table S5); DMF chips can achieve the same within 22.92 liters of physical space, although this requires manual handling to move the glass slides and assemble them into DMF chips (table S5). In contrast, DNA cassette tape system can operate fully automatically within a physical space of 6.85 liters. In addition, using more complex barcodes on the DNA tape (such as code-49 or code-16k), akin to adding more tracks on magnetic tape, or creating more precise physical partitions with addressing patterns [e.g., using lithography technology (
50)], could further increase partition density. However, this would require more advanced optical recognition systems or higher control resolution “heads” than this work for file addressing and manipulation.
Second, the DNA tape allows for convenient integration of functionalities and devices on the tape path, and through motorized operation, it can perform specific tasks in designated regions. One example is DNA deposition, which can be integrated into a tape system. Each file deposition process is faster (takes ~1 min for a single file, the theoretical limit achieves 2.3 files/s in continuous conduction mode) within an acceptable loss compared to some DNA deposition methods of solvent evaporation (
24), electrostatic interaction (
12,
15,
51), and coupling (
8,
16) (fig. S24), and the tape rotation allows for the continuous deposition of DNA files. While inkjet methods have been used for DNA synthesis and deposition on two-dimensional chips (
24,
52), increasing data storage capacity by simply raising dot density is challenging. Expanding data capacity without altering dot density typically requires enlarging the chip size. DNA tape overcomes this by converting the two-dimensional surface into a three-dimensional structure, allowing for infinite extension in one direction while maintaining a compact volume, thus linearly increasing data storage capacity and shortening the deposition time.
Another example is the ability to perform various biochemical processes, such as DNA deposition, DMRM, removal, and encapsulation/decapsulation, with high precision in a small, confined space by incorporating micro-reaction chambers. Unlike current microfluidic-based technologies for DNA storage (
8,
9,
24–
26), which require complex channel designs, micro-fabrication, and chip assembly to manipulate the liquid into a specific solid-phase region for subsequent mixing, reactions, or achieving specific functions. In contrast, our approach allows for manipulating the solid phase into the liquid for reaction by using a motor to rotate and a simple mechanical “head” to move up and down. This setup immerses the tape into the liquid and uses hydrophobic barriers on the tape to ensure limited diffusion of the liquid on the tape. This setup enables these enzymatic reactions (file removal) and polymerization (file redeposition) to be performed in designated solid regions and the generation (encapsulation) and removal of micro-nanostructures (decapsulation) and is universal for other liquid-solid operations. In our current setup, although the “head” can only remove and replace data in one partition at a time, the linear encoding method allows us to demonstrate the continuous operation of the “head” to process multiple partitions, enabling precise text editing within files (text S5).
In addition, for long-term use of the tape, the DNA files on the tape will only remain readable if the indexing system and the tape are at least as long-term chemically stable as the DNA. We observed that the barcode patterns and nylon layers did not detach or disappear (text S5), and the remaining stress of the tape was more than 99% after accelerated aging (text S8). Thus, it enables the long-term loading of DNA files. Another concern for the long-term use of the tape is wear and breakage. Although the surface roughness of a specific area on the tape decreased after 100 wear cycles, there was no notably detachment of the encapsulation layer (text S8). However, for high-speed file addressing and frequent file access, a more wear-resistant coating may be needed to ensure that the encapsulation layer continues to protect the DNA. Moreover, the DNA tape exhibits a comparable maximum force for fracture to commercial data storage tapes (Dell LTO Ultrium 3) (text S8). Even in the event of accidental breakage, the DNA tape can be rejoined using transparent adhesive tape to maintain the mechanical integrity, barcode readability, and file accessibility required for continued operation (text S8). For long-term storage, the tape casing should be made of oxidation- and corrosion-resistant materials such as titanium or stainless steel to prevent disintegration of the casing.
Our approach not only enhances operational flexibility but also supports the scalability of storage systems by allowing modular integration of additional components for different functions. For example, adding a barcode reader can easily introduce the file addressing system to the cassette tape system. The file addressing rate on DNA tape (1570 files/s, using a 10-character barcode) rivals Boolean search methods (
12) (1000 files/s with multiple copies per file) and achieves a smaller storage space than desktop-level DNA data storage through non-oligonucleotide synthesis methods (table S5). If a physical partition is filled with data (2 mm by 5 mm by 0.159 mm by 3.23 by 10
12 bits/mm
3 = 5.14 × 10
12 bits, corresponding to 0.59 TB), the rate of file addressing can reach about 917 TB/s (0.59 TB × 1570 files/s) and ~ 9 × 10
5 times faster than the Boolean search method (~1 GB/s). In addition, the most advanced barcode reader based on the laser can achieve faster recognition speed (which supports 148 symbols of code-128, 1300 barcodes/s, barcode reader BL-1300 series, Keyence), which can theoretically reach a file addressing speed at 577,200 files/s (details in text S2). It is important to note that the file addressing rate refers to the time required to locate the physical position of the DNA file, excluding subsequent DNA recovery, amplification, and sequencing operations. Thus, when reading data in parallel, the file retrieval time on the DNA tape needs to be cumulatively increased. This may necessitate adding more “heads” to increase the throughput of file recovery. Because of the aforementioned characteristics, different functional modules can be easily integrated into a compact device, creating a fully automatic DNA cassette tape drive.
Last, DNA cassette tape offers several cost-saving advantages in data addressing and multiple recoveries (tables S2 and S4) and serves as a scalable and customizable compact memory solution. The barcode information embedded within the DNA tape is flexible, allowing for the customization of files stored on the tape. This flexibility enables a collection of DNA cassette tapes to be mapped to different customers with unique catalogs, making it an adaptable storage system for diverse user needs. Furthermore, we developed a compact DNA tape drive to achieve a fully automated closed-loop operation for DNA files, including deposition, addressing, recovery, repeated deposition, and recovery, excluding DNA synthesis and sequencing. Nonetheless, it is essential to acknowledge that the large-scale, low-cost synthesis of oligo pools and their integration into storage systems remains a substantial barrier to the practical implementation of DNA data storage. Practical solutions for large-scale input of oligo pools into DNA cassette tape systems may require adaptation to novel DNA data-generation technologies, development of customized pipetting systems, or in situ synthesis of DNA sequences to increase the bandwidth of molecular data transportation to improve the bottleneck of DNA input (text S9). Encouragingly, we have validated the feasibility of phosphoramidite-based DNA synthesis and sequencing-by-synthesis on the tape (fig. S25). Because of the modular nature of DNA tape, our next step is to implement a portable, end-to-end DNA data storage device.
MATERIALS AND METHODS
Materials and characterization
Nylon C membranes were obtained from PALL Corporation. EDC and sodium bromide were obtained from Sigma-Aldrich (Shanghai, China). Sodium hydroxide (NaOH), acetic acid, sodium bicarbonate (NaHCO3), zinc nitrate hexahydrate [Zn(NO3)2.6H2O], citric acid, and disodium hydrogen phosphate (Na2HPO4) were obtained from Aladdin (Shanghai, China). SDS and 2-methylimidazole were obtained from Macklin (Shanghai, China). A 1 × tri(hydroxymethyl)aminomethane (Tris)-ethylene diamine tetraactic acid (EDTA) buffer solution (TE, pH 7.4) was obtained from Bioroyee (Beijing, China). A 20× SSPE buffer solution (pH 7.4) was obtained from OKA (Beijing, China). Ultrapure water was obtained by an ultrapure water system (PURELAB Ultra FLC00006307, ELGA). All DNA oligo pools were synthesized by Twist Bioscience. All primers and gene handles were obtained from Sangon Biotech (Guangzhou, China). TaKaRa LA Taq DNA polymerase and deoxynucleotide triphosphate (dNTP) Mixture were obtained from Takara Biotech (Beijing, China). DNase I and Bst 2.0 WarmStart DNA polymerase were obtained from New England Lab. QIAquick Gel Extraction Kit was obtained from QIAGEN (Shanghai, China). PureLink Quick PCR Purification Kit, Qubit dsDNA HS, and BR Assay Kit were obtained from Thermo Fisher Scientific (Shanghai, China). PowerUp SYBR Green was obtained from Applied Biosystem (Shanghai, China). Amino-terminated polyethylene glycol was obtained from Chongqing Yusi Pharmaceutical Technology Co. Ltd. (Chongqing, China).
The scanning electron microscope images were taken using SU8220 (Hitachi, Japan). The contact angle images were taken using a contact angle tester (DSA25, KRUSS, Germany). The atomic force microscope (AFM) images were taken using Dimension Icon (Bruker, Germany). The optical images were taken using a smartphone (iPhone 14 Pro, Apple, China). The XPS data were taken using K-Alpha (Thermo Fisher Scientific, USA). Mechanical tensile test is performed on a universal testing machine (CMT5205, SUST, China).
Generating the DNA file sequences
A text file (0.11 KB, the SUSTech Address; fig. S19) and four image files (51.6, 24.8, 46.9, 33.3, and KB, the lantern image; fig. S19) were encoded using DNA Fountain software with the following parameters: --m 3 --gc 0.05 --rs 5 --delta 0.05. These parameters ensure that the generated sequences contain up to three contiguous identical bases, a CG content of about 50 ± 5%, and 5 B of RS code combined with 16 B of data information within each oligo. DNA file sequences were generated with a length of 100 nt (text file, 41 oligos, four image files, 5010 oligos, 2476 oligos, 4476 oligos, and 4391 oligos) first, added a 4-nt restriction endonuclease restriction recognition sequence at the 3′ end, and added 20-nt adapter sequences at both ends for DNA file deposition, recovery, amplification, and qPCR. The primer sequences are shown in table S3. The text file was synthesized by column-based chemical synthesis (Sangong, China), and image files were synthesized by inkjet printing technology (Twist, USA).
DNA sequencing and decoding
After obtaining the solution after DNA file recovery, the solution was adjusted to neutrality by acetic acid solution, followed by PCR (a 25-μl system, 11.5 μl of H2O, 1 μl of template, 4 μl of 100 μM forward primer, 4 μl of 100 μM reverse primer, 2.5 μl of reaction buffer, 2.5 μl of dNTP, and 0.25 μl of LA Taq DNA polymerase) to amplify the file sequence, and the PCR procedure was as follows: (i) 95°C for 2 min, (ii) 95°C for 30 s, (iii) 60°C for 20 s, (iv) 72°C for 15 s, (ii to iv) repeated 30 times. After amplification, files were sequenced on the DNBSEQ-T7 platform (MGI, Shenzhen, China). The raw sequencing data were trimmed with cutadapt using the following parameters: --discard-untrimmed -m 21. Reads were aligned to template sequences using Burrows-Wheeler Aligner with default parameters, and the quality of the reads ≥ Q30 was used for analysis and decoding. The processed sequencing results were decoded for DNA decoding using the DNA Fountain software.
Quantification of the DNA
Synthetic DNA pools were first resuspended to 10 ng per μl in water and kept at 4°C until further use. We used qPCR (Thermo Fisher QuantStudio 7) to quantify DNA to draw standard curves using the synthetic DNA pools as templates. First, the synthesized DNA pools were diluted 100 times as PCR templates, and the procedure was performed as described previously. Next, the amplified DNA was measured using the Qubit Assay Kit (Invitrogen) and used as a known concentration. The known concentration DNA solution was diluted in a gradient, and a 50-μl qPCR system was configured (13 μl of H2O, 1 μl of the sample, 8 μl of 1 μM forward primer, 8 μl of 1 μM reverse primer, and 20 μl of PowerUp SYBR Green Master Mix). qPCR was performed with the following thermal protocol: (i) 50°C for 2 min, (ii) 95°C for 2 min, (iii) 95°C for 15 s, (iv) 60°C for 15 s, (v) 72°C for 60 s, (iii to iv) repeated 30 times. The unknown DNA concentrations were obtained using qPCR, and the cycle threshold (Ct) values obtained were brought into the fitted standard curve to obtain the unknown concentration.
Fabrication of the DNA cassette tape
The nylon C membranes were cut to 20 cm by 15 cm with a paper cutter (deli) and taped onto A4 printing paper using transparent tape. Next, print the preset barcode pattern onto the nylon membranes using an inkjet printer (HP, M281fdw). A mixture of PDMS and the curing agent (10:1) was prepared and applied evenly to the bar area of the barcode by a syringe and completing the preparation of the hydrophobic barrier on the DNA tape. Next, the nylon membranes with the barcode pattern were cut into 20 cm by 0.5 cm strips by a paper cutter and attached first and last using double-sided adhesive tape (3M) to form an extra-long tape. The gene handle solution was prepared by mixing 100 μM gene handle solution with 750 mM NaHCO3 in a 1:2 ratio and introduced into the space area of the DNA tape for 10 min, which had been immersed in 16% (w/v) fresh EDC solution for 10 min. The DNA tape was placed at 4°C until it is used.
To prepare encapsulated DNA tape, DNA tape was first immersed in the 2-Melm solution (700 mM), and then an equal volume of zinc nitrate solution (70 mM) was introduced into the system and allowed to wait for 10 min. After three washes, the encapsulated DNA tape was obtained. For decapsulating on DNA tape, 0.2 M NaHPO4 solution and 0.1 M citric acid solution were mixed in the ratio of 8.8:11.2 to obtain a buffer (pH = 4.5). The encapsulated DNA tape was immersed in the buffer for 15 s, washed three times with water to remove the protective layer of ZIFs, and used for subsequent experiments. The DNA tape cassette was custom-made by computer numerical control machining.
DNA synthesis and sequencing on DNA tape
To realize DNA synthesis using the phosphoramidite chemistry on DNA tape, we first immersed the tape in 16% (w/v) EDC for 10 min and reacted with amino-terminated polyethylene glycol solution (with 750 mM NaHCO3 in a 1:2 ratio) for 1 hour. After modifying the surface of the DNA tape with hydroxyl groups, the DNA tape was cut into 2 mm by 2 mm pieces and placed into DNA synthesis columns (DS0025, Biocomma Biotechnology Co. Ltd., Shenzhen, China), and used a DNA synthesizer (YB-192S, Yibo Biotechnology Co. Ltd., Shanghai, China) to synthesize 40-nt oligonucleotides (5-ATAAATGACCTGCCGTGCAATGGCTCATTTCACAATCGGT-3) following standard DNA synthesis protocols. Each cycle of the synthesis process included four steps: deblocking, coupling, capping, and oxidation. The deblocking reagent used was 5% (v/v) trifluoroacetic acid (TFA) in acetonitrile. The oxidation reagent consisted of 0.1 M iodine dissolved in a 9:1 pyridine/acetic acid solution (v/v). The coupling reagent was 0.5 M tetrazole in acetonitrile. The capping reagents were obtained from Suresyn Co. Ltd. (Shenzhen, China). Each synthesis cycle includes deblocking (120 s), washing (120 s), coupling A/T/C/G (480 s), washing (120 s), capping (80 s), washing (120 s), oxidizing (80 s), and washing (120 s) steps. Subsequently, we performed in situ PCR amplification on the synthesized DNA. The primers used for the PCR are listed in table S3.
File addressing rate test
We configured the DNA tape with 10 consecutive barcodes containing identical data characters. In addition, we established groups of barcodes with 2 (a0-a9), 4 (ab01-ab09), 6 (abc001-abc009), 8 (abcd0001-abcd0009), and 10 (abcde00001-abcde00009) data characters. By setting different rotational speeds within the range of 0 to 2000 rpm, we obtained sampling results via the terminal and calculated the accuracy of barcode reading. Each experiment was repeated three times, and the calculation formula is as follows
File deposition on DNA tape
To deposit DNA data on the DNA tape, the configured DNA file solution (1 μl, consisting of synthetic DNA file (0.2 ng/μl), 0.05 mM dNTP, 8 U of Bst 2.0 WarmStart DNA polymerase, 10× reaction buffer, and 6 mM MgSO4) was introduced through a syringe onto the physical partitions of the DNA tape and reacted at 60°C, 80% RH for 1 min to complete the deposition process of the DNA file.
Multiple file recovery and redeposition on DNA tape
The target file to be recovered from the DNA tape was immersed in 20 μl of NaOH (0.15 M) for 5 min, and the alkaline liquid was collected to complete the file recovery operation. The collected alkaline liquid was adjusted to neutral by adding 5 μl of 1 × TE (pH = 7.4) and acetic acid solution and then sequencing and decoding to complete the file recovery. The area which had been read on the DNA tape could be restored to the initial dsDNA state by adding recovery solution (A 50-μl system, including 1 μl of 10 μM primer, 1 μl of 2.5 mM dNTP, 1 μl of Bst 2.0 WarmStart DNA polymerase, 5 μl of reaction buffer, 3 μl of MgSO4, and 39 μl of H2O). After each step, the tape was washed five times with 2× SSPE/0.1% SDS solution and 1× TE buffer solution, respectively. We used the Puzzle 1.png file and repeated the above operation 10 times for multiple DNA file recovery experiments to collect the alkaline liquid and perform subsequent sequencing analysis.
The target DNA file to be redeposited on the DNA tape was immersed in 40 μl of reaction mix (4 μl of Mbo I endonuclease, 8 μl of reaction buffer, and 28 μl of H2O) for 15 min at 37°C, followed by the addition of NaOH (0.15 M) for 5 min to complete the file removal. For subsequent DNA file deposition, the area to be redeposited on the DNA tape was immersed in 50 μl of reaction mix [including 1 μl of subsequent file (10 ng/μl), 1 μl of 2.5 mM dNTP, 1 μl of Bst 2.0 WarmStart DNA Polymerase, 5 μl of reaction buffer, 3 μl of MgSO4, and 39 μl of H2O] for 10 min at 60°C to complete the DNA file redeposition. After each step, the tape was washed five times with 2× SSPE/0.1% SDS solution and 1× TE buffer solution, respectively. We used the first 10 sequences of SUSTech Address.txt and repeated the above operation 10 times for multiple redeposition experiments. The DNA file recovery operation was performed before each redeposition to collect the alkaline liquid from the extracted file and perform subsequent sequencing analysis.
Stability test of DNA tape
The DNA files were deposited onto DNA tape by the above method and cut to 1 mm–by–5 mm size. Then, the DNA tapes were placed in 2-Melm solutions of different concentrations (10, 100, 300, 500, 700, and 1000 mM). Different concentrations of Zn(NO3)2.6H2O (1, 10, 30, 50, 70, and 100 mM) was added dropwise and reacted for 10 min to form different thicknesses of ZIFs protective layer, followed by adding DNase I (19.5 μl of H2O, 0.5 μl of 1 M MgSO4, and 1 μl of DNase I) to them for digestion for 30 min. After decapsulated, DNA files were extracted using the method described above, and the concentration and quality of DNA files were analyzed using qPCR and MGI sequencing, respectively. For the DNA control group, 0.5 μl of MgSO4 (1 M) and 1 μl of DNase I were added to 19.5 μl of DNA files (~3.2 × 107 copies/μl), and the DNA tape control group did not experience synthesizing the protective layer.
For accelerated aging experiments, DNA tape and encapsulated DNA tape were placed at 50% humidity and different temperatures (60°, 65°, and 70°C) for 3 weeks as the previous research described (
13,
17,
53). The 50% humidity environment was maintained by placing a saturated sodium bromide solution in a gas-tight container. DNA copies at different time points were obtained using the DNA file recovery method and qPCR.
Automated recovery and redeposition of the DNA files
We divided Lantern.png into four images (Puzzle 1.png, Puzzle 2.png, Puzzle 3.png, and Puzzle 4.png) and deposited them randomly into the physical data partition of DNA tape (the address assignment symbols of the four images are c7_2, c8_6, e4_4, and b6_9, respectively) using the DNA file deposition method described before. During the deposition process, SUSTech Address.txt had been intentionally deposited to the file address of Puzzle 3.png, and the removal of SUSTech Address.txt and the redeposition of the Puzzle 3.png file had been completed by the automatic operation of the DNA tape drive, which lastly completes the reading of Lantern.png after sequencing. The decapsulation solution mixture (1 ml, pH = 4.5, weak acid buffer), file recoveries solution mixture (1 ml, 0.15 M, NaOH), file recovery solution mixture (a 1-ml system, including 20 μl of 100 μM primer, 20 μl of 2.5 mM dNTP, 20 μl of Bst 2.0 WarmStart DNA Polymerase, 100 μl of reaction buffer, 60 μl of MgSO4, and 780 μl of H2O), file erasing solution mixture (A 400-μl system, 40 μl of Mbo I endonuclease, 80 μl of reaction buffer, and 280 μl of H2O), subsequent file solution mixture [a 1-ml system, 20 μl of subsequent file (10 ng/μl), 20 μl of 2.5 mM dNTP, 20 μl of Bst 2.0 WarmStart DNA Polymerase, 100 μl of reaction buffer, 60 μl of MgSO4, and 780 μl of H2O], and encapsulation solution [500 μl, 70 mM Zn(NO3)2.6H2O and 500 μl, 700 mM 2-Melm, two different liquids in syringes were connected using a three-way valve] were first loaded into the liquid supply chamber of the DNA tape dive.
We designed an interactive interface and control program in Unity software to perform file manipulation in the DNA tape drive. The control program can control the barcode recognition camera, laser locator, and motor for primary and secondary positioning. Puzzle 1.png, Puzzle 2.png, SUSTech Address.txt, and Puzzle 4.png were pre-entered into the system directory of the DNA tape dive. Select the Puzzle 1.png, Puzzle 2.png, and Puzzle 4.png in the file directory on the touch screen display and set the recovery parameters as follows: Step 1 is the flow of the decapsulation solution mixture (liquid supply flow rate and time: 40 μl/min, 15 s, discharge flow rate and time: 40 μl/min, 15 s), step 2 is the flow of the file recoveries solution mixture (liquid supply flow rate and time: 160 μl/min, 12 s; discharge flow rate and time: 0 μL/min, 0 s; duration: 288 s) through the microfluidic duct into the micro-reaction chamber, step 3 is the flow of the file recovery solution mixture (liquid supply flow rate and time: 160 μl/min, 12 s; discharge flow rate and time: 0 μl/min, 0 s; duration: 588 s; temperature: 60°C) through the microfluidic duct into the micro-reaction chamber, step 4 is the flow of the encapsulation solution (liquid supply flow rate and time: 160 μl/min, 12 s; discharge flow rate and time: 0 μl/min, 0 s; duration: 588 s) through the microfluidic duct into the micro-reaction chamber. The “erase data” were selected from e4_4 and the subsequent file Puzzle 3.png to e4_4 was deposited on the screen display to complete the DNA file redeposition operation.
The DNA redeposition parameters are set as follows: Step 1 is the flow of the decapsulation solution mixture (liquid supply flow rate and time: 40 μl/min, 15 s; discharge flow rate and time: 40 μl/min, 15 s) through the microfluidic duct into the micro-reaction chamber, step 2 is the flow of the file erasing solution mixture (liquid supply flow rate and time: 160 μl/min, 12 s, discharge flow rate and time: 0 μl/min, 0 s; duration: 888 s, temperature: 37°C) through the microfluidic duct into the micro-reaction chamber, step 3 is the flow of the subsequent file solution mixture (liquid supply flow rate and time: 160 μl/min, 12 s; discharge flow rate and time: 0 μl/min, 0 s; duration: 588 s; temperature: 60°C) through the microfluidic duct into the micro-reaction chamber, step 4 is the flow of the file recoveries solution mixture (liquid supply flow rate and time: 160 μl/min, 12 s, discharge flow rate and time: 0 μl/min, 0 s; duration: 288 s) through the microfluidic duct into the micro-reaction chamber, step 5 is the flow of the file recovery solution mixture (liquid supply flow rate and time: 160 μl/min, 12 s; discharge flow rate and time: 0 μl/min, 0 s; duration: 588 s; temperature: 60°C) through the microfluidic duct into the micro-reaction chamber, step 6 is the flow of the encapsulation solution (liquid supply flow rate and time: 160 μl/min, 12 s; discharge flow rate and time: 0 μl/min, 0 s; duration: 588 s) through the microfluidic duct into the micro-reaction chamber. Afterward, the DNA tape drive automatically completed the setup steps and collected the target file into a liquid collection chamber for subsequent sequencing and decoding.
Acknowledgments
We thank L.B. Zi for assistance with instrument construction.
Funding: This research work was supported by the National Key R&D Program of China (2018YFA0902600 to X.J.), the National Natural Science Foundation of China (22234004 to X.J. and 22427804 to X.J.), the Guangdong Provincial Key Laboratory of Advanced Biomaterials (2022B1212010003 to X.J.), the Guangdong Innovative and Entrepreneurial Research Team Program (2019ZT08Y191 to X.J.), the Shenzhen Science and Technology Program (KQTD20190929172743294 to X.J., ZDSYS20200811144003009 to X.J., JCYJ20220818101407017 to X.J., SGDX20230116091642001 to X.J., GJHZ20220913142610019 to D.W.), the Guangdong Major Talent Introduction Project (2019CX01Y196 to X.J.), and the Tencent Foundation through the XPLORER PRIZE (to X.J.).
Author contributions: Conceptualization: J.L., X.J., and R.W. Methodology: J.L., C.M., D.W., and R.W. Visualization: J.L. and R.W. Software: J.L., S.W., S.Z., J.S., and R.W. Investigation: J.L., S.W., X.Lu., X. Li, and R.W. Writing—original draft: J.L. and R.W. Writing—review and editing: J.L., J.S., X.J., and C.F.
Competing interests: X.J. and J.L. are inventors on a patent application (patent publication no. CN114360600A and CN114388067A) submitted by Guangzhou Jiaquan Patent and Trademark Office Co. Ltd. on 31 December 2021. Both patents are in patent examination status. All other authors declare that they have no competing interests.
Data and materials availability: The data and code that support the findings of this study are available within the article and its Supplementary Materials files. Analysis software and encoder/decoder are available at
http://doi.org/10.5281/zenodo.15788037.