This help document answers the following types of questions about gene traps.
General questions about gene traps
Top
What is gene trapping?
Gene trapping is a high-throughput approach used to introduce genome-wide insertional mutations.
Gene trap experiments in embryonic stem (ES) cells produce mutant cell lines in which integration into a putative gene is selected by expression of a selection gene in ES cells. The trapped gene is usually (though not necessarily) mutated by the integration, altering either the gene expression or the protein product produced. The site of integration can be characterized by cloning, extension of cDNA products, or direct sequencing of the insertion site.
See also:
Top
How are gene trap alleles named?
Once a gene trap cell line is isolated and the insertion site identified, a gene trap mutant cell line designation is used to create an allele symbol.
- If the trapped gene is known, the allele designation has the following format:
GenesymbolGt(mutant cell line name or vector name or vector content)Labcode
For example:
Ntn1Gt(ST629)Byg | The gene trap mutant cell line ST629 generated by BayGenomics contains an insertional mutation known to disrupt the netrin 1 (Ntn1) gene. |
- If the cell line designation is unavailable, the designation includes a serial number in addition to Gt, cell line/vector name/vector content, and laboratory ILAR code.
- When there is no marker association for a gene trap cell line, an allele (without a marker) includes the Gt(mutant cell line name or vector name or vector content)Labcode designation. For example:
Gt(CSH240)Byg | A gene trap at an undefined locus in mutant ES cell line CSH240 made by BayGenomics. |
Top
Who are the cell line creators?
- The MGI database contains all dbGSS mouse gene traps from the following cell line creators: BayGenomics, CMHD, EGTC, ESDB, FHCRC, GGTC, SIGTR, TIGEM, and TIGM, as well as from Lexicon Genetics.
- The database also contains targeted mutant cell lines from organizations such as Velocigene.
- The following table lists the major creators/organizations, their abbreviations and associated ILAR lab codes, and provides an example of cell line ID, sequence tag ID, dbGSS (GenBank) sequence ID, and allele or mutant cell line nomenclature for a gene trap from each creator.
- There are no sequence tag or GenBenk sequence identifiers for the Velocigene targeted KOMP alleles.
Creator |
Abbreviation |
Lab Code / Name (ILAR) 1 |
Cell Line ID (Creator) |
Sequence Tag ID (Creator) |
Seq ID (GenBank) |
Nomenclature Example |
BayGenomics |
BG |
Byg / BayGenomics |
XH152 |
XH152 |
CC199534 |
Alms1Gt(XH152)Byg |
Centre for Modeling Human Disease |
CMHD |
Cmhd / Centre for Modeling Human Disease | CMHD-GT_141.1H7-3 | CMHD-GT_141.1H7-3 | CL706334 | Etv2Gt(CMHD-GT_141.1H7-3)Cmhd |
Exchangeable Gene Trap Clones |
EGTC |
Imeg / Institute of Molecular Embryology and Genetics | Ayu21-B196 | Ayu21-B196 | AB299416 | Adipor2Gt(Ayu21-B196)Imeg |
Embryonic Stem Cell Database |
ESDB |
Mfgc / Mammalian Functional Genomics Centre |
PST11001 |
PST11001 | CZ192167 | AtrxGt(PST11001)Mfgc |
Fred Hutchinson Cancer Research Center |
FHCRC |
Sor / Philippe Soriano | FHCRC-GT-S20-7E1 | FHCRC-GT-S20-7E1 | CL706141 | Chd7Gt(FHCRC-GT-S20-7E1)Sor |
German Gene Trap Consortium |
GGTC |
Wrst / German Gene Trap Consortium | W223E05 | W223E05 | CL215544 | Gnl3Gt(W223E05)Wrst |
Lexicon Genetics |
N/A |
Lex / Lexicon Genetics |
OST179231 | OST179231 | CG559761 | Pomgnt1Gt(OST179231)Lex |
Sanger Institute Gene Trap Resource |
SIGTR |
Wtsi / Wellcome Trust Sanger Institute | BC0122 | BC0122 | CZ259233 | Atg16l1Gt(BC0122)Wtsi) |
Telethon Institute of Genetics and Medicine |
TIGEM |
Mist / Telethon Institute of Genetics and Medicine |
GC0844 |
GC0844 |
CW020574 |
Gt(GC0844)Mist |
Texas A&M Institute for Genomic Medicine |
TIGM |
Tigm / Texas A&M Institute for Genomic Medicine |
IST10006D1 |
IST10006D1BBF1 |
EF717953 |
Slc25a40Gt(IST10006D1)Tigm |
Velocigene2 |
N/A |
Vlcg / Velocigene |
11648B-F7 |
N/A |
N/A |
Abcb9tm1(KOMP)Vlcg |
Notes:
- Institute for Laboratory Animal Research (ILAR)
- This organization provides targeted mutant cell lines.
Top
Is there a way to tell which group created a mutant ES cell line?
If MGI has evidence from a publication or from direct submission, you can determine the ES cell line creator. On a Gene-trapped Allele Detail report, the creator information appears in the Nomenclature section. The lab code for the creator terminates the Symbol and is displayed at the end of the allele Name. See Who are the cell line creators? for additional information about creator abbreviations, lab codes, and cell line IDs.
Top
How can I determine if an ES cell line has been made into a mouse?
- Locate the Mutation Origin section of a Gene-trapped Allele Detail report.
- If MGI has a paper containing evidence that the cell line has been made into a mouse, Germline Transmission: and Earliest citation of germline transmission plus a citation appears. Click the J: number to access MGI Reference Query Results for complete details on the paper.
Note: The Find Mice (IMSR) section of the Gene-trapped Allele Detail report contains information on this topic. See How can I tell if a cell line or a mouse strain carrying the mutation is available for purchase? for an example.
Top
How can I tell if a cell line or a mouse strain carrying the mutation is available for purchase?
Locate the Find Mice (IMSR) section of a Gene-trapped Allele Detail report. The information indicates whether or not mouse strains and cell lines are available from the International Mouse Strain Resource (IMSR).
Top
How can I identify a gene trap mutant ES cell line ID?
- Locate the Mutation Origin section of any Gene-trapped Allele Detail report.
- The first line in the table row identifies the mutant cell line ID. In the example below, the mutant cell line ID is AZ0718.
- The Parent Cell Line, Strain of Origin, and Project Collection may also appear in the Mutation Origin section.
- Other rows in the Gene-trapped Allele Detail report may identify sequence tag IDs for this cell line and indicate International Mouse Strain Repository (IMSR) availability.
See the table at Who are the cell line creators? for examples of additional accession identifiers associated with cell lines.
Top
Is there a list of all gene trap cell line accession IDs?
Yes. Click Download-->MGI Data and Statistical Reports--> to access MGI Marker associations to Gene Trap IDs (tab-delimited).
Top
Is there a list of all IDs and nomenclature associated with gene traps in MGI?
Yes. Click Download-->All MGI Reports--> to see all gene traps associated with markers.
Top
Searching for gene trap or cell line data
Top
Where can I find gene trap and/or cell line data?
MGI incorporates mouse cell line and gene trap sequences for almost all gene traps in dbGSS including IGTC and Lexicon Genetics gene traps, and TIGM. The International Gene Trap Consortium (IGTC) houses publicly available gene trap cell lines available upon request.
Top
How can I find all the available sequence-tagged gene traps from a specific creator?
Use the MGI Phenotypes, Alleles, & Disease Models Query Form.
- Enter the provider's ILAR-designated lab code, preceded by the wild card, an asterisk (*), in the Gene/Marker, or Allele field (in the Nomenclature & Genome Location section). Examples: *Imeg (Institute of Molecular Embryology and Genetics - EGTC) or *Wtsi (Wellcome Trust Sanger Institute). Leave the default (contains).
- Click gene trapped (in the Categories section).
- Click Search. The result is aSummary of all gene traps containing your character string entry. Note that you can also find affected anatomical systems and genetic backgrounds of interest in this report.
Alternately, instead of searching for the project lab code in the Nomenclature section, if your creator is listed in the Project Collections portion of the Categories section of the form, you can just check the project. In this section, you can make multiple selections.
Top
Can I search for all gene traps or only for those that have been made into mice?
Both searches are possible. You can customize your search to include all gene traps or only those made into mice.
To include all gene traps:
To include only gene traps made into mice:
- On the MGI Phenotypes, Alleles & Disease Models Query Form:
- In the top center of the Categories section, check Exclude alleles existing only as cell lines.
- In the Generation Method portion of the Categories section, check gene trapped.
- Click Search.
Top
What are the ways of searching for gene trap insertions?
- You can use the Phenotypes, Alleles, & Disease Models Query Form to search for gene trap insertions:
- By phenotype
- In the top row, next to Mouse phenotypes & mouse models of human disease, enter phenotype terms in the Phenotype/Disease box. (The Select or Browse links provide lists or structures or ontologies to choose from.)
- In the Generation Method portion of the Categories section, check gene trapped.
- Make any other selections as desired (e.g., you may want to limit your search to a chromosomal region or Allele Attributes).
- Click Search to begin and/or Reset to clear the form.
- By nomenclature
- Enter the symbol, name, or synonym of a known mouse gene in the Gene/Marker, or Allele field to find any associated gene traps.
- Check gene trapped (in the Categories section).
- Click Search.
- By chromosomal location
- In the Nomenclature & genome location section, click the chromosome of the associated marker and enter additional information as desired in the Cytogenetic Band, cM Position, or Genome Coordinates boxes.
- In the Categories section, check gene trapped.
- Click Search.
Note: The MGI gene trap algorithm restricts the search to gene traps with marker associations; a chromosomal location search therefore cannot return gene traps without those associations. Thus, when there is no marker association for the gene trap, a search by location from the Phenotypes, Alleles, & Disease Models Query Form does not yield a result. However, if you know a landmark or a region, try entering this location in MGI's JBrowse and clicking the GeneTraps_RNA_tags and GeneTraps_DNA_tags tracks. (See How can I tell if a given gene trap has trapped a gene? and How do I interpret JBrowse gene trap tracks? for additional information about JBrowse.)
- By category
- In the Categories section, click gene trapped.
- By default, this query queries returns all gene traps (cell line and mouse line). If you want to limit this search to cell lines made into mice, click Exclude alleles existing only as cell lines.
- Click Search.
- You can use the Quick Search box (at the top of every MGI page) and enter:
- the nomenclature (symbol, name, synonym) for a gene or an allele or a creator/provider abbreviation (see Who are the cell line creators? for a list).
- an accession identifier (mutant cell line ID, provider ID, GenBank sequence tag ID). Summary reports return links to any associated MGI detail pages.
- only the cell line designation to find gene trap alleles not located in a known gene.
Top
Where can I find vector information?
The Gene-trapped Allele Detail report provides the name, type, and description of the vector used for a gene trap event if available.
Locate Mutation description--> Mutation. If there is vector information available, it appears on the Mutation line.
Top
Can I use the MGI Batch Query to find gene traps?
No. The ability to enter a group of cell line accession IDs and return information about corresponding gene traps may be available in a future MGI release.
Top
I search using a gene trap ID (or nomenclature) but get no marker results. Why not?
There may be several reasons for not getting expected results:
- The gene trap is not in GenBank.
- The gene trap is in GenBank, but it is not from one of the established creators (see Who are the cell line creators? for a list).
- There is no gene-trap-to-gene association for the gene trap.
If you are using the Quick Search tool, a link to the allele detail may appear in the Other Results by ID section of the Quick Search Results. You can also try entering a landmark or a region in JBrowse and clicking the GeneTraps_RNA_tags and GeneTraps_DNA_tags tracks. (See How can I tell if a given gene trap has trapped a gene? and How do I interpret JBrowse gene trap tracks? for additional information.)
See Why aren't all gene traps associated with a gene? for more details.
Top
Finding gene associations
Top
How can I find all the gene traps associated with a given gene?
- You can use the MGI Quick Search tool:
- Enter the symbol, synonym, or name of a gene in the Quick Search box (on the top of any MGI page). A Quick Search Results page appears. If the gene is in the MGI database, it is listed under Genome Features.
- Click the desired gene symbol. A Gene Detail page appears.
- Locate the Mutations, Alleles and Phenotypes section. If there are associated gene traps, Gene trapped followed by the number of gene trapped alleles is shown.
- Click the number to see a Phenotypic Allele Query Results Summary that lists all the gene traps currently associated with this gene. In the example below, that number is 309.
- You can use the MGI Phenotypes, Alleles, & Disease Models Query Form. Enter the gene nomenclature and follow the appropriate links.
- You can look through the MGI Marker associations to Gene Trap IDs (tab-delimited). report to locate the gene and its traps.
Top
How can I find all genes associated with gene trap insertions?
Click Download-->MGI Data and Statistical Reports--> to access MGI Marker associations to Gene Trap IDs (tab-delimited) for this list.
Or:
- On the Phenotypes, Alleles, & Disease Models Query Form, locate the Categories section.
- Click the box beside gene trapped.
- Click Search. A Phenotypic Alleles Summary report appears, identifying the number and name of all alleles associated with gene trap insertions.
Top
Why aren't all gene traps associated with a gene?
MGI does not establish gene trap-to-gene associations:
- where sequence tags for a gene trap allele overlap multiple genes in the trapped gene orientation:
->
----->
------>
---------------
- where sequence tags for a gene trap allele overlap one or more genes in reverse orientation (not disrupted by the gene trap mechanism):
<-
----->
--------------
- where sequence tags for a gene trap allele overlap no known genes using current gene model coordinates:
->
--------------
- where gene trap sequence tags for an allele do not align with the reference mouse genome:
---------------
- where gene trap sequence tags for an allele align equally well to multiple locations:
ChrA 1->
--------------
ChrB 1->
---------------
- for mixed cell lines, characterized by multiple sequence tags for the same cell line that have good alignments to different locations in the genome:
ChrA 1->(MCL:xyz1)
-------------------
ChrB 2-> (MCL:xyz1)
-------------------
Legend:
-> Gene Trap Sequence Tag aligned to the genome. The arrow indicates gene trap polarity.
Numbers Indicate tag identities.
MCL Indicates Mutant Cell Line identity.
-----> Known gene aligned to the genome. Arrow indicates transcriptional polarity.
----------- A location in the reference mouse genome. For multiple locations, different chromosomes are indicated.
Top
How can I find genes near a gene trap not associated with a gene?
If the Gene-trapped Allele Detail report displays a genome browser link, you can view gene trap sequence tag positions relative to genome coordinates.
- In the Mutation description section, click the icon
() next to Genome Context.
- Click View all gene trap sequence tags within a 10kb region (bottom of the thumbnail). The MGI Genome Browser application (JBrowse) appears.
- Use the Scroll/Zoom feature in JBrowse to modify the coordinate range.
Top
How can I find gene traps not associated with a gene but that are near a gene of interest?
- Use the Genes and Markers Query Form to locate the gene of interest. The Genes and Markers Query Results -- Summary appears.
- Click the gene symbol on the summary report. The Gene Detail report appears.
- In the Location & Maps section (on the Gene Detail report), click the more icon. If this gene has been mapped to the latest genome assembly, it will display a thumbnail image and a link for the Mouse Genome Browser. You can follow this link to view the gene relative to genome coordinates.
- Click Mouse Genome Browser.
- In the left hand column, confirm that the gene trap allele tracks are selected (GeneTraps_DNA_tags, GeneTraps_RNA_tags, GeneTraps_per_Gene).
- Use the JBrowse magnifying glass icons to zoom in or out. All gene traps, regardless of gene association, will appear.
Top
Can I search for gene traps where no known genes exist?
No. However, MGI's JBrowse provides views of gene trap sequence tag positions relative to genome coordinates. See Can I view the approximate locations of gene trap insertions? for how to access this genome browser. See How can I tell if a given gene trap has trapped a gene? and How do I interpret JBrowse gene trap tracks? for additional JBrowse information.
Note: You can find any gene trap, regardless of gene association, by using its mutant cell line ID, provider tag ID, or GenBank sequence tag ID.
Top
Can I use a marker detail report to determine how many gene traps are available as ES cell lines only, as both cell lines and mice, or as mice only?
Not currently. This may be available in the future in a public report.
Top
Searching for sequence information
Top
Where can I find sequence tag identifier(s)?
- Locate the Mutation description section on a Gene-trapped Allele Detail report.
- Click the () icon next to Sequence Tags.
In the table that appears, the GenBank ID column lists the GenBank sequence tag ID for each gene trap insertion.
The Tag ID column contains the sequence tag identifiers.
The sequence tag's GenBank identifier (GenBank ID), the sequencing technique (Method), and chromosome, map position, and strand (Tag Location) also appear. Note that you can select sequences here to download in FASTA format or forward to NCBI's BLAST program.
Top
Is there a way to determine sequence tag orientation?
Yes.
- Information about gene orientation with respect to a given sequence tag appears in the Gene-trapped Allele Detail report.
- Locate the Mutation description row on a Gene-trapped Allele Detail report.
- Click the icon () next to Sequence Tags.
- Find the Tag Location (strand) column. Strand information appears in parentheses, after the chromosome number and genome coordinates. Plus (+) indicates that the sequence tag has a left to right orientation; minus (-) indicates right to left.
- Expand the Genome Context section and then click View all gene trap sequence tags … to see a full-page genome browser view of the gene trap insertion. Use the magnifying glass icons in JBrowse to zoom in or out.
Top
Is is possible to tell whether a sequence tag defines the insertion site (is a genomic- or DNA-based tag) or not (is an RNA-based tag)?
Yes. On a Gene-trapped Allele Detail report, click the () icon next to Sequence Tags. The table that appears contains a Method column where the information you are looking for appears (if known).
If the method is 3' RACE or 5' RACE, the tag defining the insertion site is RNA-based. Any other method listed is DNA- (or genomic-) based (for example, inverse PCR, 3' or 5' splinkerette, plasmid rescue).
Top
Where can I find the coordinates and strand for a given gene trap sequence tag?
The Gene-trapped Allele Detail report displays this information for any gene traps that map to a single position in the mouse genome.
- In the Mutation description section, click the () icon next to Sequence Tags.
- The table that appears contains a Tag Location (strand) column where the coordinates and strand for each tag appear.
- Plus (+) indicates that the sequence tag has a left to right orientation; minus (-) indicates right to left.
- Unknown or Multiple may also appear in this column.
See the previous image for an example of the information in the Tag Location (strand) column.
Top
Can I view coordinates/strands for all significant sequence tag BLAST hits for mixed cell line gene traps that map to multiple locations?
Yes. When the Mutation description section (next to Mutation) of a Gene-trapped Allele Detail report indicates that sequence tags for an insertion align in multiple locations:
- Click the () icon next to Sequence Tags.
- In the table that appears, click the sequence tag listed beneath Select (in the last column).
- Click the down icon next to Selected Tags: and select NCBI BLAST.
- Click Go. The NCBI BLAST query form appears with the relevant fields filled in. Adjust as desired (e.g., you may prefer different Output options.)
- Click the BLAST button.
Top
Can I download the sequence tag for a gene trap insertion in FASTA? Can I send it to NCBI BLAST?
Yes. This option is available on the following MGI reports:
- Gene-trapped Allele Detail (Expand the Sequence Tag section to see this option.)
- Sequence Detail.
Top
Can I find gene trap insertions by BLASTing against gene trap sequences and searching by sequence similarity?
Yes. Use NCBI's BLAST. See NCBI BLAST Help for how to use this tool.
Top
Can I view the approximate locations of gene trap insertions?
Yes. Both the Gene-trapped Allele Detail report and the MGI genome browser (JBrowse) provide views of gene trap sequence tag positions relative to representative gene transcripts.
In the Mutation description section, click the icon (click the icon () next to Genome Context. To access the JBrowse view, click View all gene trap sequence tags in this region at the bottom of the image that appears.
See How can I tell if a given gene trap has trapped a gene? and How do I interpret JBrowse gene trap tracks? for additional information.
For gene trap alleles not associated with markers, click View all gene trap sequence tags within a 10kb region to access the JBrowse view. The display shows a 5kb region on either side of the gene trap sequence tag.
Top
How do I interpret JBrowse gene trap tracks?
GeneTraps_per_Gene track
This color-coded gene trap frequency track for MGI genes indicates the number of insertions per gene. Only insertions that "trap" a gene (i.e., overlap the gene's coordinates in the correct transcriptional orientation) are counted.
See also How can I tell if a given gene trap has trapped a gene? for additional information.)
Gene model color | Number of gene traps associated with the marker |
Red | More than 1000 |
Orange | 101 - 1000 |
Green | 11 - 100 |
Blue | 2 - 10 |
Violet | One |
Black | None |
GeneTraps_RNA_tags track
These gene trap sequence tags from GenBank dbGSS have an RNA-based sequence tag method (5'-RACE or 3'-RACE).
- RNA-based gene trap sequence tags are derived from the exon adjacent to the insertion site.
- Gene trap mutagenesis requires that the vector inserts in the correct transcriptional orientation of the "trapped" gene.
- Sequence tag direction indicates the strand of gene trap mutagenesis.
- Sequence tags that overlap known genes in the opposite transcriptional orientation do not disrupt those genes from a gene trapping mechanism and are not counted in this track.
Sequence tag method | Insertion site in the intron |
5'-RACE (blue) | Downstream from the most 3' exon represented in the sequence tag |
3'-RACE (green) | Upstream from the most 5' exon represented in the sequence tag |
GeneTraps_DNA_tags track
These gene trap sequence tags from GenBank dbGSS have a DNA-based sequence tag method.
- DNA-based gene trap sequence tags are derived from genomic DNA at the site of gene trap vector insertion.
- Gene trap mutagenesis requires that the vector inserts in the correct transcriptional orientation of the "trapped" gene.
- Sequence tag direction indicates the strand of gene trap mutagenesis.
- Sequence tags that overlap known genes in the opposite transcriptional orientation do not disrupt those genes from a gene trapping mechanism and are not counted in the GeneTraps_per_Gene track.
How can I tell if a given gene trap has trapped a gene?
If the MGI allele_detail page has an association for the gene, and in MGI JBrowse,
- the sequence tag location overlaps a gene and
- the direction of the arrow is the same as that of the transcript with respect to the gene model
then the gene trap has trapped a gene.
Top