BioNumPy

13 posts

BioNumPy banner
BioNumPy

BioNumPy

@BioNumPy

Python library for array programming on biological datasets. Documentation available at: https://t.co/R622AQLPzs

Katılım Ocak 2023
27 Takip Edilen56 Takipçiler
BioNumPy
BioNumPy@BioNumPy·
.. or around transcription factor peak summits. Here we plot the pileup for reads on the positive and negative strand, and clearly see the pileups we expect around the summit.
BioNumPy tweet mediaBioNumPy tweet media
English
1
1
2
268
BioNumPy
BioNumPy@BioNumPy·
BioNumPy has been updated with changes that make it a lot easier to work with genomic intervals and data on a reference genome 😀 Here are a few cool examples to illustrate the new stuff:
English
1
5
12
1.1K
BioNumPy
BioNumPy@BioNumPy·
@vsbuffalo BioNumPy has some support for gtf files, but have not yet defined a complete set of abstractions for them. Would love suggestions for useful methods!
BioNumPy tweet media
English
0
0
0
22
Vince Buffalo
Vince Buffalo@vsbuffalo·
I'm curious now about how often we re-write bioinformatics code. You have to parse a GTF/GFF file in Python. What do you do?
English
17
3
10
8.1K
BioNumPy
BioNumPy@BioNumPy·
@vsbuffalo BioNumPy tries to give a uniform interface for parsing common bioinformatics formats. Keeping the header information from input through analysis to output is under progress and will be in place for BioNumPy 0.3 github.com/bionumpy/bionu…
English
0
0
0
6
Vince Buffalo
Vince Buffalo@vsbuffalo·
I think having unified comment metadata headers would go a long way. So often I lose column headers and metadata rows (i.e. starting with '#') when using the Unix data tools I love. We need a set of tools, even if simple wrappers, that preserve this important data.
English
4
3
22
3.9K
Vince Buffalo
Vince Buffalo@vsbuffalo·
So much of bioinformatics is endless writing of parsers for file formats written by software, and it just shouldn't be this way. This is needlessly time-intensive, bug-prone, and prevents higher-level abstractions from being developed. We need tidy bioinformatics.
English
9
32
292
61.2K
BioNumPy
BioNumPy@BioNumPy·
Day 2/5 of small BioNumPy examples: Motif matching. We download a motif from Jaspar, compute max motif score per read in a FASTQ file and plot a histogram of the scores. Run the code here: colab.research.google.com/github/bionump…
BioNumPy tweet mediaBioNumPy tweet media
English
0
7
6
621
BioNumPy
BioNumPy@BioNumPy·
Every day this week, we'll share a small example of how BioNumPy can be used. First out: FASTQ filtering (try out the code yourself here: colab.research.google.com/github/bionump…) .. and remember to follow us for daily updates ☺️
BioNumPy tweet media
English
0
8
10
627