Skip to Main content Skip to Navigation
Journal articles

Sequence Determination from Overlapping Fragments : A Simple Model of Whole-Genome Shotgun Sequencing

Abstract : Assembling fragments randomly sampled from along a sequence is the basis of whole-genome shotgun sequencing, a technique used to map the DNA of the human and other genomes. We calculate the probability that a random sequence can be recovered from a collection of overlapping fragments. We provide an exact solution for an infinite alphabet and in the case of constant overlaps. For the general problem we apply two assembly strategies and give the probability that the assembly puzzle can be solved in the limit of infinitely many fragments.
Document type :
Journal articles
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-03285626
Contributor : Bernard Derrida <>
Submitted on : Wednesday, July 21, 2021 - 9:16:20 AM
Last modification on : Wednesday, July 28, 2021 - 12:04:29 PM

File

Sequence Determination from Ov...
Explicit agreement for this submission

Identifiers

Citation

Bernard Derrida, Thomas M A Fink. Sequence Determination from Overlapping Fragments : A Simple Model of Whole-Genome Shotgun Sequencing. Physical Review Letters, American Physical Society, 2002, 88 (6), pp.068106. ⟨10.1103/PhysRevLett.88.068106⟩. ⟨hal-03285626⟩

Share

Metrics

Record views

22

Files downloads

29