superbed

change column header to reflect relative location

Jan 21, 2012

aeb846d · Jan 21, 2012

Name	Name	Last commit message	Last commit date
parent directory ..
README.rst	README.rst	add superbed annotatoin stuff.	May 24, 2011
superanno.py	superanno.py	change column header to reflect relative location	Jan 21, 2012
superbed.py	superbed.py	show command for refGene superbed and let geneSymbol be optional	Aug 1, 2011

README.rst

Annotate a Bed File

Given a file in some kind of bed format (at least the first 3 cols are chr start end), generate a new file with 2 extra columns: gene, distance. In cases where the distance is zero, the feature type(s) where the overlap occured is reported. These could be introns/exons/utrs, etc.

Example Workflow

Get the data from UCSC (or your local mirror).

ORG=hg19
mysql -D $ORG -e "select chrom,txStart,txEnd,cdsStart,cdsEnd,K.name,X.geneSymbol,proteinID,strand,exonStarts,exonEnds from knownGene as K,kgXref as X where  X.kgId=K.name" > $ORG.notbed

Check the actual command in UCSC if you do not have a local DB set up.

create a bed6 file with a line for each column:

python superbed.py $ORG.notbed > $ORG.super.bed

install `bedtools`_ and `pybedtools`_

annotate some data with superanno.py:

python superanno.py -a my.bed -b $ORG.super.bed --header > my.annotated.bed

my.annotated.bed will now have 2 extra columns: gene(s), distance/feature_type.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

superbed

superbed

README.rst

Annotate a Bed File

Example Workflow

Files

superbed

Directory actions

More options

Directory actions

More options

Latest commit

History

superbed

Folders and files

parent directory

README.rst

Annotate a Bed File

Example Workflow