@@ -39,13 +39,31 @@ combinations of the annotation features *gene\_name*, *gene\_id*,
39
39
* transcript\_ name* , * transcript\_ id* , * exon\_ id* as well as the location of
40
40
these genomic elements (contig, start position, end position, strand).
41
41
42
- ## Gene Names
42
+ ## Genes
43
43
44
- ` gene_names() `
45
- : returns all gene names in the annotation database
44
+ ` genes(contig=None, strand=None) `
45
+ : returns list of Gene objects, optionally restricted to a particular contig
46
+ or strand.
46
47
47
- ` gene_names_on_contig(contig) `
48
- : all gene names on a particular chromosome/contig
48
+ ` genes_at_locus(contig, position, end=None, strand=None) `
49
+ : returns list of Gene objects overlapping a particular position on a contig,
50
+ optionally extend into a range with the ` end ` parameter and restrict to
51
+ forward or backward strand by passing ` strand='+' ` or ` strand='-' ` .
52
+
53
+ ` gene_by_id(gene_id) `
54
+ : return Gene object for given Ensembl gene ID (e.g. "ENSG00000068793")
55
+
56
+ ` gene_names(contig=None, strand=None) `
57
+ : returns all gene names in the annotation database, optionally restricted
58
+ to a particular contig or strand.
59
+
60
+ ` genes_by_name(gene_name) `
61
+ : get all the unqiue genes with the given name (there might be multiple
62
+ due to copies in the genome), return a list containing a Gene object for each
63
+ distinct ID.
64
+
65
+ ` gene_by_protein_id(protein_id) `
66
+ : find Gene associated with the given Ensembl protein ID (e.g. "ENSP00000350283")
49
67
50
68
` gene_names_at_locus(contig, position, end=None, strand=None) `
51
69
: names of genes overlapping with the given locus
@@ -63,23 +81,28 @@ these genomic elements (contig, start position, end position, strand).
63
81
` gene_name_of_exon_id(exon_id) `
64
82
: name of gene associated with given exon ID
65
83
66
-
67
- ## Gene IDs
68
-
69
84
` gene_ids(contig=None, strand=None) `
70
85
: all gene IDs in the annotation database
71
86
72
- ` gene_id_of_gene_name (gene_name)`
73
- : translate Ensembl gene ID to its corresponding name
87
+ ` gene_ids_of_gene_name (gene_name)`
88
+ : all Ensembl gene IDs with the given name
74
89
75
90
76
- ## Transcript Names
91
+ ## Transcripts
92
+
93
+ ` transcripts(contig=None, strand=None) `
94
+ : returns list of Transcript objects for all transcript entries in the
95
+ Ensembl database, optionally restricted to a particular contig or strand.
96
+
97
+ ` transcript_by_id(transcript_id) `
98
+ : construct Transcript object for given Ensembl transcript ID (e.g. "ENST00000369985")
99
+
100
+ ` transcripts_by_name(transcript_name) `
101
+ : returns list of Transcript objects for every transcript matching the given name.
77
102
78
103
` transcript_names(contig=None, strand=None) `
79
104
: all transcript names in the annotation database
80
105
81
- ## Transcript IDs
82
-
83
106
` transcript_ids(contig=None, strand=None) `
84
107
: returns all transcript IDs in the annotation database
85
108
@@ -89,14 +112,14 @@ these genomic elements (contig, start position, end position, strand).
89
112
` transcript_ids_of_gene_name(gene_name) `
90
113
: return IDs of all transcripts associated with given gene name
91
114
92
- ` transcript_id_of_transcript_name (transcript_name)`
93
- : translate transcript name to its ID
115
+ ` transcript_ids_of_transcript_name (transcript_name)`
116
+ : find all Ensembl transcript IDs with the given name
94
117
95
118
` transcript_ids_of_exon_id(exon_id) `
96
119
: return IDs of all transcripts associatd with given exon ID
97
120
98
121
99
- ## Exon IDs
122
+ ## Exons
100
123
101
124
` exon_ids(contig=None, strand=None) `
102
125
: returns all transcript IDs in the annotation database
@@ -109,32 +132,3 @@ these genomic elements (contig, start position, end position, strand).
109
132
110
133
` exon_ids_of_transcript_id(transcript_id) `
111
134
112
-
113
- ## Locations
114
-
115
- These functions currently assume that each gene maps to a single unique
116
- location, which is invalid both with heavily copied genes
117
- (e.g. [ U1] ( http://en.wikipedia.org/wiki/U1_spliceosomal_RNA ) ) and with
118
- polymorphic regions (e.g. HLA genes).
119
-
120
- ` location_of_gene_name(gene_name) `
121
-
122
- ` location_of_gene_id(gene_id) `
123
-
124
- ` location_of_transcript_id(transcript_id) `
125
-
126
- ` location_of_exon_id(exon_id) `
127
-
128
-
129
- ## Start Codons
130
-
131
- ` start_codon_of_transcript_id(transcript_id) `
132
-
133
- ` start_codon_of_transcript_name(transcript_id) `
134
-
135
-
136
- ## Stop Codons
137
-
138
- ` stop_codon_of_transcript_id(transcript_id) `
139
-
140
- ` stop_codon_of_transcript_name(transcript_name) `
0 commit comments