Robert Vaser

58 posts

Robert Vaser

Robert Vaser

@robertvaser

Zagreb, Croatia Katılım Kasım 2015
16 Takip Edilen120 Takipçiler
Robert Vaser retweetledi
Mile Sikic
Mile Sikic@msikic·
I am hiring a postdoc at the Genome Institute of Singapore. Join @NiranjanTW and me on a project aiming to rapidly identify microbes in a sample using sequencing data and deep learning methods inspired by song recognition. DM or email me
English
1
27
44
0
Robert Vaser retweetledi
A*STAR Genome Institute of Singapore (A*STAR GIS)
Scientists from GIS, namely Prof Mile Sikic and Postdoctoral Fellow Dr. Robert Vaser have published their work on a novel de novo genome assembler called 'Raven in Nature Computational Science'. Check it out! Read: [bit.ly/3faKGOW]
A*STAR Genome Institute of Singapore (A*STAR GIS) tweet media
English
0
9
26
0
Robert Vaser retweetledi
Mile Sikic
Mile Sikic@msikic·
De novo assembler Raven is published at NatCompSci rdcu.be/ckZT1. Easy to use, fast and memory efficient. Try it with metagenomic data too. w/@robertvaser !!!
English
3
55
137
0
Robert Vaser
Robert Vaser@robertvaser·
We decreased memory requirements to ~40% of the input FASTA size plus a constant not exceeding 16GB in the latest Raven version (github.com/lbcb-sci/raven…), without any significant impact on execution time. 1/2
English
2
13
27
0
Robert Vaser
Robert Vaser@robertvaser·
@cyxiang1027 @msikic RC:i:<int> is the number of reads concatenated into the contig. XO:i:<bool> denotes if the contig is circular (1 - is, 0 - is not). Circular contigs can also be seen in GFA.
English
1
0
0
0
2001
2001@NYFCBW·
@msikic @robertvaser SAM-like tag in sequence headers from Raven,what does "RC:i:8 XO:i:0" mean?How can i tell if the sequence is circular?
English
1
0
0
0
Robert Vaser retweetledi
Mile Sikic
Mile Sikic@msikic·
Raven version 1.5.0 - up to 2x faster (increases with data accuracy), improves NGA50 in most cases, properly handles small plasmids. w/ @robertvaser github.com/lbcb-sci/raven
English
1
22
48
0
Robert Vaser
Robert Vaser@robertvaser·
@lh3lh3 @IvanSovic @XLR I think I know where the bug is. As Ivan already mentioned, Racon retains the longest overlap per read, but the last longest overlap is retained, instead of the first. Thanks for testing/answers!
English
0
0
2
0
Heng Li
Heng Li@lh3lh3·
A technical question (could save me a few hours on experiment): does Racon use all alignments in SAM for consensus or does it ignore secondary alignments?
English
2
6
22
0
Heng Li
Heng Li@lh3lh3·
@robertvaser @IvanSovic @XLR This is T2T CHM13. 3.05Gb in size. Alignments of the same query are always clustered together. Note that many reads are aligned in full length to multiple locations.
English
1
0
0
0
Robert Vaser
Robert Vaser@robertvaser·
@lh3lh3 @IvanSovic @XLR Minimap2 transforms 4Gbp into minimizer index? Alignments per read are outputted in one block, i.e. the SAM is already sorted by read name/id?
English
1
0
0
0
Heng Li
Heng Li@lh3lh3·
@robertvaser @IvanSovic @XLR No options. Overlaps from immediate minimap2 SAM output. I believe racon is still using all the secondary alignments.
English
1
0
0
0
Heng Li
Heng Li@lh3lh3·
@IvanSovic @XLR Did the experiment anyway. For T2T HiFi, --secondary=no reduces substitution errors from 379.3k to 3.4k, a 100-fold reduction.
English
2
2
5
0
Robert Vaser
Robert Vaser@robertvaser·
Similar requirements hold for Racon as well, although this improvement is on branch library (github.com/lbcb-sci/racon…) which currently does not support overlaps/alignments from the outside. Any feedback is welcome! 2/2
English
0
0
7
0
Robert Vaser
Robert Vaser@robertvaser·
@andreaswallberg @msikic That depends on the sequencing depth. The memory consumptions should be qual to ~1.3 times the FASTQ file.
English
0
0
3
0
Andreas Wallberg
Andreas Wallberg@andreaswallberg·
@msikic @robertvaser It looks great! I am trying it on a complex 18Gbp genome using Nanopore data, that was previously assembled with wtdbg2. Will probably not be able to fit it into the 1 TB RAM machine I have access to atm but hope to be able to run it on a 4.7 TB RAM later. Tips are welcome!
English
1
0
2
0