TODO¶
IMPORTANT¶
For large-contigs test, the evenness of the coverage should be assessed.
Perform large contig, many contig, and ultra deep tests.
make releasewould fail on platforms without pkg-config, especially on Haiku OS and Debian GNU/Hurd.Add support for PE FASTA/FASTQ.
Performance¶
I/O:
The home-made “asynchronous IO” spent too much time in deallocating and creating new
std::unique_ptrs. This problem is more obvious on smaller objects, like FASTA when being compared to FASTQ.Consider using the method implemented in
pigz. That is, create a ring buffer that stores raw pointers to record datagrams that allows reusing.The current implementation passes too many small objects across the concurrent queue and I/O handlers, which is inefficient. This problem will be considerably worsen if POSIX AIO is used.
I/O Formats¶
Support circular genome or RNA?
Support simulating BGI/MGISEQ reads and new Illumina models: Started.
Add
--i-nreadsto accurately specify the number of reads to simulate?
Simulate Allele-Specific Expression¶
The simulator will accept a genome, a variation VCF file, and a gene annotation GTF file.