Home  -   Genomes  -   Blat  -   Tables  -   Gene Sorter  -   PCR  -   Proteome  -   Session  -   FAQ  -   Help
 The Genome Browser Database

This page describes the format of the genome annotation databases that underlie the UCSC Genome Browser. These data were contributed by many researchers, as described on the Genome Browser Credits page. Please acknowledge the contributor(s) of the data you use. All tables for an assembly are freely usable for any purpose except as indicated in the README.txt file in the downloadable annotation database directory for that release.

NOTE: This page is no longer maintained. For complete up-to-date table descriptions, use the "describe table schema" button in the Table Browser.



  Table Descriptions

The data displayed in the Genome Browser are stored in a MySQL database. The tables in the database can be grouped into four categories:

  • tables in which the data has been split into a separate table for each chromosome
  • tables that contain information for all chromosomes
  • tables that contain information specifically related to mRNA sequences
  • tables intended primarily for internal use

All coordinates in these tables are half-open zero-based. This means that the first 100 bases of a chromosome are represented as [0,100), i.e. 0-99. The second 100 bases are represented as [100,200), i.e. 100-199, and so forth. An advantage of half-open coordinate ranges is that the length can be obtained by simply subtracting the start from the end.

Data that have been split into a separate table for each chromosome have the naming convention chrN_[table], where N is replaced by the chromosome name. In many cases a table has been added that contains the unordered part (random) of the chromosome, chrN_random_[table]. Finally, in addition to the standard chromosome names, some assemblies have two special chromosome names, UL and NA, representing fingerprint clone contigs that could not be placed confidently on a chromosome, and individual clones that could not be placed, respectively. The table names are hyperlinked to an autoSql format description of their fields.



  Tables Split by Chromosome

This section shows data that has been split into a separate table for each chromosome.

[chr6] [chrN_B] [chrN_C] [chrN_E] [chrN_G] [chrN_H] [chrN_I] [chrN_M] [chrN_R] [chrN_V] [chrN_W] [chrN_Z]


 Data Stored in a Single Table

The tables in this section contain data stored in a single table rather than in one table per chromosome.

[A]  [B]  [C]  [D]  [E]  [F]  [G]  [H]  [I]  [J]  [K]  [M]  [N]  [O]  [P]  [Q]  [R]  [S]  [T]  [U]  [V]  [W]  [X]  [Y]  [Z] 
  • a_gambiae_myp2a - Pairwise MAF alignment of A. gambiae to D. melanogaster
  • acembly - Acembly Gene Predictions With Alt-splicing
  • acemblyPep - Translations of Acembly gene predictions into corresponding amino acid sequences
  • affy10K - SNPs on the Affymetrix 10K SNP Genotyping Array
  • affy10KDetails - Affymetrix 10K SNP sequence details
  • affyZebrafish - Sequences used for probe selection on Affymetrix zebrafish chip
  • affyExps - Experiment descriptive data
  • affyGeno - SNPs on the Affymetrix 120K SNP Genotyping Array
  • affyGenoDetails - Affymetrix 120K SNP array information
  • affyGn1h - Sequences used for probe selection on Affy GNF1H chips
  • affyGnf1m - Sequences used for probe selection on Affy GNF1M chips
  • affyGnfU74A - Sequences used for probe selection on Affy GNFU74A chips
  • affyGnfU74B - Sequences used for probe selection on Affy GNFU74B chips
  • affyGnfU74C - Sequences used for probe selection on Affy GNFU74C chips
  • affyMOE430 - Sequences used for probe selection on Affy MOE430 chips
  • affyRAE230 - Sequences used for probe selection on Affy RAE230 chips
  • affyU34A - Sequences used for probe selection on Affy U34A chips
  • affyU74 - Sequences used for probe selection on Affy U74 chips
  • affyRatio - Expression data from GNF using Affymetrix GeneChips
  • affyTranscriptome - Transcriptome data for chr21 and chr22 from Affymetrix
  • affyU133 - Consensus and exemplar sequences used for probe selection on Affy HG-U133A and HG-U133B chips
  • affyU133Plus2 - Consensus and exemplar sequences used for probe selection on the Affy HG-U133 Plus 2.0 chip
  • affyU95 - Consensus and exemplar sequences used for probe selection on Affy HG-U95Av2 chip
  • affyZebrafish - Consensus and exemplar sequences used for probe selection on Affy Zebrafish chip
  • all_bacends - BLAT alignment information for BAC clone end sequences on the draft assembly sequence
  • all_est - EST alignments (concatenation of chr*_est)
  • all_fosends - Fosmid end alignments
  • all_mrna - mRNA alignments (concatenation of chr*_mrna)
  • all_sts_primer - BLAT alignment information for STS markers using primer sequences on the draft sequence assembly
  • all_sts_seq - BLAT alignment information for full STS markers sequences on the draft sequence assembly
  • altGraphX - Summary of alt-splicing in EST and mRNA tracks
  • anoGam1_050201 - Scored reference file for anoGam1 alignments
  • anophelesEcores - D. melanogaster (dm1)/A. gambiae evolutionary conserved regions (ecores) computed by Exofish
  • apiMel1_050201 - Scored reference file for apiMel1 alignments
  • arbFlyLifeAll - Expression data
  • arbFlyLifeAllExps - Experiment descriptive data
  • arbFlyLifeAllRatio - Expression data
  • arbFlyLifeMedian - Expression data
  • arbFlyLifeMedianExps - Experiment descriptive data
  • arbFlyLifeMedianRatio - Expression data
  • atlasOncoGene - Link to Atlas Oncology site
  • axtNetCb1 - Scored reference file for C. elegans/C. briggsae alignments
  • axtNetDp1 - Scored reference file for D. melanogaster/D. pseudoobscura alignments
  • axtNetHg16 - Scored reference file for human/chimp alignments
  • bacEndPairs - Locations of BAC clones using placements of its 5' and 3' end sequences. BAC end sequence information comes from the dbGSS portion of GenBank
  • bacEndSingles - Locations of BAC clones where only one end is mapped (Zebrafish)
  • bactigPos - Bactig positions in chromosome coordinates
  • bdgpBlastTab - Blastp results of known genes vs. one another (D. melanogaster)
  • bdgpCanonical - Describes the canonical splice variant of a gene (Drosophila)
  • bdgpGene - Berkeley Drosophila Genome Project genes (D. melanogaster)
  • bdgpNonCoding - Berkeley Drosophila Genome Project non-coding genes (D. melanogaster)
  • bdgpIsoforms - Links together various transcripts of a gene into a cluster (Drosophila)
  • bdgpToLocusLink - Associates known gene and LocusLink accession (Drosophila)
  • bdgpToPfam - Associates known gene and Pfam ID (Drosophila)
  • bdgpToRefSeq - Associates known gene and RefSeq accession (Drosophila)
  • bgiCov - Areas of chicken genomic coverage of 3 alternate chicken strains sequenced by Beijing Genomics Institute
  • bgiGene - Alignments of chicken protein-coding genes annotated by Beijing Genomics Institute
  • bgiSnp - Chicken SNP information from Beijing Genomics Institute
  • bioCycMapDesc - BioCyc pathway maps
  • bioCycPathway - Cross reference between BioCyc Pathway and Known Genes
  • blastDm1FB - tBLASTn alignments of peptides from D. melanogaster predicted and known genes in FlyBase
  • blastHg16KG - tBLASTn alignments of peptides from hg16 knownGene predicted and known genes
  • blastHg17KG - tBLASTn alignment of peptides from hg17 knownGene predicted and known genes
  • blastKGPep0* - tBLASTn alignments peptide sequences
  • blastKGRef0* - Associates a known gene ID with a gene name, position, and SwissProt name for tBLASTn protein tracks
  • blatChimp - Blat alignments of Pan troglodytes against the draft human genome
  • blatCioSav1 - Blat alignments of Ciona savignyi against the current genome
  • blatFr1 - Translated BLAT alignments of Takifugu rubripes assembly fr1 against the current genome
  • ceBlastTab - Blastp results of known genes vs. one another (C. elegans)
  • celeraCoverage - Summary of large genomic duplications from Celera data
  • celeraDupPositive - Summary of large genomic duplications from Celera data
  • celeraOverlay - Celera assembly overlay in the public NCBI assembly
  • cgapAlias - Associates CGAP pathway IDs with gene symbols or mRNA accessions
  • cgapBiocDesc - CGAP/BioCarta pathway description
  • cgapBiocPathway - Associates CGAP pathway IDs with BioCarta pathway IDs
  • chainLink - Alignment block in chain
  • chainRxBestPt0 - Summary info about a chain of chimp alignments
  • chainRxBestPt0Link - Alignment block in chimp chain
  • chicken_hmrg - Chicken scored reference file for human/chimp/mouse/rat/chicken alignments
  • chicken_netBlastz - Scored reference file for chicken alignments
  • chimpBac - Blat alignments of Pan troglodytes BACs against the draft human genome
  • chimpBlat - Blat alignments of Pan troglodytes against the draft human genome
  • chimpDels - Human genome assembly regions deleted in chimp
  • chimp_hmrg - Chimp scored reference file for human/chimp/mouse/rat/chicken alignments
  • chimp_netBlastz - Scored reference file for chimp alignments
  • chimpSimpleDiff - Simple differences between the chimp and human genome assemblies
  • choExpDistance - Distance between 2 genes in expression space (S. cerevisiae)
  • chromInfo - Describes name and size of each chromosome as well as an external file containing the chromosome
  • clonePos - Positions of all clones in golden path. Also HTG phase, finished/draft/predraft stage, and sequence size
  • contamination - Contamination information for current assembly
  • contigAcc - Maps a contig to its accession
  • cpgIsland - CpG islands
  • cpgIslandExt - CpG islands including observed/expected ratio
  • ctgPos - Name, size, and golden path position of all contigs
  • ctgPos2 - Name, size, golden path position and type of all contigs
  • cytoBand - 800 chromosome bands mapped to golden path by Terry Furey using fish data from BAC Resource Consortium
  • cytokines - Cytokine gene annotations (Tetraodon) fish data from BAC Resource Consortium
  • cytoBandIdeo - Information for mapping ideogram
  • d_pseudoobscura_myp2a - Pairwise MAF alignment of D. pseudoobscura to D. melanogaster
  • d_yakuba_myp2a - Pairwise MAF alignment of D. yakuba to D. melanogaster
  • danRer1_netBlastz - Scored reference file for zebrafish (danRer1) alignments
  • danRer2_netBlastz - Scored reference file for zebrafish (danRer2) alignments
  • dbSnpRS - Information from dbSNP at the reference SNP level
  • dbSnpRsHg - Human genome information from dbSNP at the reference SNP level
  • dbSnpRsMm - Mouse genome information from dbSNP at the reference SNP level
  • dbSnpRsRn - Rat genome information from dbSNP at the reference SNP level
  • dbtssAli - RefSeq mRNA extended to the 5' end from DBTSS (Database of Transcriptional Start Sites)
  • dmBlastTab - Blastp results of known genes vs. one another (D. melanogastor)
  • dog_netBlastz - Scored reference file for dog alignments
  • dp2_050201 - Scored reference file for dp2 alignments
  • drBlastTab - Blastp results of known genes vs. one another (Danio rerio)
  • droAna1_050201 - Scored reference file for droAna1 alignments
  • droMoj1_050201 - Scored reference file for droMoj1 alignments
  • droVir1_050201 - Scored reference file for droVir1 alignments
  • droYak1_050201 - Scored reference file for droYak1 alignments
  • dupSpMrna - Duplicate mRNA/Protein entries that have identical CDS structures
  • ECgene - ECGene gene predictions
  • ECgenePep - Translations of ECgene genes into corresponding amino acid sequences
  • ecoresFr1 - Human/Fugu evolutionary conserved regions (ecores) computed by Exofish
  • ecoresHg16 - Human (hg16)/Fugu evolutionary conserved regions (ecores) computed by Exofish
  • ecoresHg17 - Human (hg17)/Fugu evolutionary conserved regions (ecores) computed by Exofish
  • ecoresMm3 - Mouse (mm3)/Fugu evolutionary conserved regions (ecores) computed by Exofish
  • ecoresRn3 - Rat (rn3)/Fugu evolutionary conserved regions (ecores) computed by Exofish
  • ecoresTetNig1 - Tetraodon (tetNig1)/Human evolutionary conserved regions (ecores) computed by Exofish
  • ecoresTetraodon - Tetraodon/Human evolutionary conserved regions (ecores) computed by Exofish
  • encodeAffyChIpRnapHl50PvalHr[00,02,08,32] - Affymetrix ChIP-chip enrichment of Pol2 from retinoic acid stimulated HL60 cells at 0, 2, 8, or 32 hours, in ENCODE regions (4 different tracks)
  • encodeAffyChIpRnapHl60SitesHr[00,02,08,32] - Affymetrix ChIP-chip of Pol2 from retinoic acid stimulated HL60 cells at 0, 2, 8, or 32 hours, in ENCODE regions (4 different tracks)
  • encodeAffyRnaHl60SigHr[00,02,08,32] - Affymetrix PolyA+ RNA from retinoic acid stimulated HL60 cells at 0, 2, 8, or 32 hours, in ENCODE regions (4 different tracks)
  • encodeAffyRnaHl60SitesHr[00,02,08,32] - Affymetrix PolyA+ RNA abundance from retinoic acid stimulated HL60 cells at 0, 2, 8, or 32 hours, in ENCODE regions (4 different tracks)
  • encodeRegionInfo - Descriptive, assembly-independent information about ENCODE regions
  • encodeRegions - Names and positions of selected regions for the ENCODE project
  • encode_NHGRI_DNAseHS - DNaseI-hypersensitive sites in ENCODE regions
  • encode_Stanford_Promoters - Activity levels of predicted promoters in ENCODE regions (Stanford)
  • encode_UCSD_ChIP_RNAP_HCT116 - ChIP-chip (Pol II HCT116) in ENCODE regions (UCSD)
  • encode_UCSD_ChIP_RNAP_HELA - ChIP-chip (Pol II HeLa) in ENCODE regions (UCSD)
  • encode_UCSD_ChIP_RNAP_IMR90 - ChIP-chip (Pol II IMR90) in ENCODE regions (UCSD)
  • encode_UCSD_ChIP_RNAP_THP1 - ChIP-chip (Pol II THP1) in ENCODE regions (UCSD)
  • encode_UCSD_ChIP_TAF250_HCT116 - ChIP-chip (TAF1 HCT116) in ENCODE regions (UCSD)
  • encode_UCSD_ChIP_TAF250_HELA - ChIP-chip (TAF1 HeLa) in ENCODE regions (UCSD)
  • encode_UCSD_ChIP_TAF250_IMR90 - ChIP-chip (TAF1 IMR90) in ENCODE regions (UCSD)
  • encode_UCSD_ChIP_TAF250_THP1 - ChIP-chip (TAF1 THP1) in ENCODE regions (UCSD)
  • encode_UVA_DNARep_0_2 - Temporal profiling of DNA replication (0-2 hrs) in ENCODE regions (U. VA)
  • encode_UVA_DNARep_2_4 - Temporal profiling of DNA replication (2-4 hrs) in ENCODE regions (U. VA)
  • encode_UVA_DNARep_4_6 - Temporal profiling of DNA replication (4-6 hrs) in ENCODE regions (U. VA)
  • encode_UVA_DNARep_6_8 - Temporal profiling of DNA replication (6-8 hrs) in ENCODE regions (U. VA)
  • encode_UVA_DNARep_8_10 - Temporal profiling of DNA replication (8-10 hrs) in ENCODE regions (U. VA)
  • ensEstGene - Cross-reference info between transcript entries and translation entries of Ensembl genes
  • ensemblXref2 - Cross-reference info between transcript entries and translation entries of Ensembl genes (alternative table used in some assemblies)
  • ensemblXref3 - Cross-reference info between transcript entries and translation entries of Ensembl genes (alternative table used in some assemblies)
  • ensEst - Ensembl gene predictions based on ESTs
  • ensEstGene - Ensembl gene predictions based on ESTS
  • ensEstGtp - Associates Ensembl gene, transcript, and protein IDs
  • ensEstPep - Translations of Ensembl EST-based gene predictions into corresponding amino acid sequences
  • ensGene - Ensembl Gene Predictions
  • ensGeneXref - Table gene_xref downloaded from Ensembl
  • ensGtp - Associates Ensembl gene, transcript, and protein IDs
  • ensPep - Translations of Ensembl gene predictions into corresponding amino acid sequences
  • ensTranscript - Table transcript downloaded from Ensembl
  • esRegGeneToMotif - Eran Segal regulatory module predicted trancription factor binding sites
  • estOrientInfo - Extra information on ESTs calculated by polyInfo program
  • exoFish - Genescope Exofish Tetraodon evolutionarily conserved regions (Last used Aug. 2001)
  • exoMouse - Exonerate mouse/human evolutionarily conserved regions (Last used Apr. 2001)
  • exoniphy - Evolutionarily conserved protein-coding exons in multiple, aligned sequences identified by the ExoniPhy program
  • fbAllele - Alleles of a FlyBase gene
  • fbGene - Links FlyBase IDs, gene symbols and gene names
  • fbGo - Links FlyBase gene IDs and GO IDs/aspects
  • fbPhenotype - FlyBase observed phenotype in mutant
  • fbRef - FlyBase literature or database reference
  • fbRole - FlyBase role of gene in wildType
  • fbSynonym - Links FlyBase ID to gene synonyms in database
  • fbTranscript - Links FlyBase gene IDs and BDGP transcript IDs
  • flyBase2004Xref - Cross reference of FlyBase gene ID, symbolic gene name, synonyms, and accessions (circa late 2004)
  • flyBaseGene - FlyBase gene predictions
  • flyBasePep - FlyBase genes translated proteins
  • flyBaseNoncoding - FlyBase genes noncoding RNAs
  • flyBaseSwissProt - FlyBase acc to SwissProt acc, plus some other SwissProt info
  • flyreg - FlyReg data (D. melanogaster - dm1)
  • flyreg2 - FlyReg data (D. melanogaster - dm2)
  • flyregMotif - FlyReg gapless motif (D. melanogaster)
  • firstEF - First Exon Finder (FirstEF) exon and promoter predictions
  • fishClones - Clones placed on Cytogenetic Map using FISH
  • foldUtr3 - Info about folding of RNA into secondary structure
  • foldUtr5 - Info about folding of RNA into secondary structure
  • fosEndPairs - Positions of end pairs for fosmids
  • fugu_netBlastz - Scored reference file for Fugu alignments
  • galGal2_netBlastz - Scored reference file for chicken alignments
  • gaze - Genoscope GAZE gene annotations (Tetraodon)
  • gbProtAnn - GenBank protein annotations
  • gc5Base - Percentage of G or C bases in a 5-base window
  • gcPercent - Percentage of G or C bases in a 20,000-base window
  • geneBands - Band locations of known genes
  • geneid - Gene predictions from the geneid program
  • geneidPep - Translations of geneid gene predictions into its corresponding amino acid sequence
  • genieAlt - Genie gene predictions - all constrained by one degree or another by EST or mRNA
  • genieAltPep - Translations of Genie gene predictions into its corresponding amino acid sequence
  • genieKnown - Known genes processed by Genie to allow sequencing error (Oct. 2000 and earlier known gene track)
  • genieKnownPep - Translations of known genes into its corresponding amino acid sequence (Oct. 2000 and earlier known gene track)
  • genMapDb - BAC clones from GenMapDB placed on the assembly by U Penn (V. Cheung)
  • genomicSuperDups - Duplications of >1000 Bases Sequence
  • genscan - Genscan Gene Predictions
  • genscanPep - Translations of Genscan gene predictions into corresponding amino acid sequences
  • genscanSubopt - Genscan byproducts: good individual exons out of frame with main predictions (Last used Apr. 2001)
  • gnfAtlas2 - Expression data from the GNF Gene Expression Atlas 2
  • gnfAtlas2Distance - Describes the distance between genes in the expression space (GNF Atlas 2)
  • gnfHumanAtlas2All - Expression data
  • gnfHumanAtlas2AllExps - Experiment descriptive data
  • gnfHumanAtlas2AllRatio - Expression data
  • gnfHumanAtlas2Median - Expression data
  • gnfHumanAtlas2MedianExps - Experiment descriptive data
  • gnfHumanAtlas2MedianRatio - Expression data
  • gnfHumanU95All - Expression data
  • gnfHumanU95AllExps - Experiment descriptive data
  • gnfHumanU95AllRatio - Expression data
  • gnfHumanU95Median - Expression data
  • gnfHumanU95MedianExps - Experiment descriptive data
  • gnfHumanU95MedianRatio - Expression data
  • gnfMouseAtlas2All - Expression data
  • gnfMouseAtlas2AllExps - Experiment descriptive data
  • gnfMouseAtlas2AllRatio - Expression data
  • gnfMouseAtlas2Median - Expression data
  • gnfMouseAtlas2MedianExps - Experiment descriptive data
  • gnfMouseAtlas2MedianRatio - Expression data
  • gnfMouseU74aAll - Expression data
  • gnfMouseU74aAllExps - Experiment descriptive data
  • gnfMouseU74aAllRatio - Expression data
  • gnfMouseU74aMedian - Expression data
  • gnfMouseU74aMedianExps - Experiment descriptive data
  • gnfMouseU74aMedianRatio - Expression data
  • gnfMouseU74bAll - Expression data
  • gnfMouseU74bAllExps - Experiment descriptive data
  • gnfMouseU74bAllRatio - Expression data
  • gnfMouseU74bMedian - Expression data
  • gnfMouseU74bMedianExps - Experiment descriptive data
  • gnfMouseU74bMedianRatio - Expression data
  • gnfMouseU74cAll - Expression data
  • gnfMouseU74cAllExps - Experiment descriptive data
  • gnfMouseU74cAllRatio - Expression data
  • gnfMouseU74cMedian - Expression data
  • gnfMouseU74cMedianExps - Experiment descriptive data
  • gnfMouseU74cMedianRatio - Expression data
  • gnfRatAtlas2All - Expression data
  • gnfRatAtlas2AllExps - Experiment descriptive data
  • gnfRatAtlas2AllRatio - Expression data
  • gnfRatAtlas2Median - Expression data
  • gnfRatAtlas2MedianExps - Experiment descriptive data
  • gnfRatAtlas2MedianRatio - Expression data
  • gnfU95Distance - Describes the distance between genes in the expression space (GNF U95)
  • HInv - Full-length cDNAs from the H-Invitational Human Gene Annotation Database
  • HInvGeneMrna - H-Invitational genes mRNA data
  • haplotype - Common Haplotype Blocks - U. of Oxford and Sanger Institute (chr22 only)
  • hg17_netBlastz - Scored reference file for human hg17 alignments
  • hgFindSpec - Information that defines a search performed by hgFind binary
  • hoxGenes - HOX gene mRNAs
  • hugo - Cross reference between hugo and other databases
  • humanChain - Summary info about chain of human alignments
  • human_chm - Pairwise multiple alignments between current species and human
  • humanDels - Human deletions in chimp genome
  • humanNet - Database representation of a net of alignments with human
  • humorMm3Rn3 - Human, Mouse, and Rat multiz alignments for hg16, mm3, and rn3 (human only)
  • imageClone - Used with image consortium's cumulative_plate files at: ftp://image.llnl.gov/image/outgoing"
  • interProXref - Associates InterPro ID with Swiss-Prot accessions
  • jaxOrtholog - Jackson Lab Mouse Orthologs
  • jaxQTL - Jackson Lab Mouse Quantitative Trait Loci
  • jgiGene - Alignments of JGI predicted transcripts to the current genome
  • keggMapDesc - Description of KEGG pathway map
  • keggPathway - Associates KEGG pathway IDs with Known Genes and LocusLink IDs
  • kgAlias - Links together a Known Gene ID and a gene alias
  • kgProtAlias - Links together a Known Gene ID and a protein alias
  • kgProtMap - Known Genes protein mapping
  • kgSpAlias - Links together a Known Gene ID, Swiss-Prot/TrEMBL accession and an alias
  • kgXref - Links together a Known Gene ID and mRNA, SwissProt, RefSeq, and NCBI accessions/IDs
  • kimWormLifeAllExps - Experiment descriptive data
  • kimWormLifeAllRatio - Expression data
  • kimWormLifeMedianExps - Experiment descriptive data
  • kimWormLifeMedianRatio - Expression data
  • knownBlastTab - Blastp results of known genes vs. one another (human)
  • knownCanonical - Describes the canonical splice variant of a gene
  • knownExpDistance - Describes the distance between genes in the expression space (UCLA)
  • knownGene - Protein coding genes based on proteins from SWISS-PROT, TrEMBL, and TrEMBL-NEW and their corresponding mRNAs from GenBank
  • knownGeneLink - Stores the seqType info for Known Genes track entries.
  • knownGeneMrna - Known Genes mRNA
  • knownGenePep - Known Genes translated proteins
  • knownInfo - Other information about known genes (Oct. 2000 and earlier known gene track)
  • knownIsoforms - Links together various transcripts of a gene into a cluster
  • knownMore - Links to other databases: GenBank, OMIN, HUGO, RefSeq, GBD (Oct. 2000 and earlier known (RefSeq) gene track)
  • knownToCdsSnp - Associates known gene and coding SNP IDs
  • knownToEnsembl - Associates known gene and Ensembl transcripts
  • knownToGnfAtlas2 - Associates known gene and Atlas2 expression data
  • knownToGnf1h - Associates known gene and GNF1H expression data
  • knownToHInv - Associates known gene and HInv transcripts
  • knownToLocusLink - Associates known gene and LocusLink ID
  • knownToMOE430 - Associates known gene and Affy MOE430 expression data
  • knownToMOE430A - Associates known gene and Affy MOE430A expression data
  • knownToPfam - Associates known gene and Pfam ID
  • knownToRefSeq - Associates known gene and RefSeq accession
  • knownToSuper - Associates known gene and Superfamily ID
  • knownToU133 - Associates known gene and Affy U133 chip ID
  • knownToU133Plus2 - Associates known gene and Affy U133 Plus 2.0 chip ID
  • knownToU133 - Associates known gene and Affy U133 chip ID
  • knownToU74 - Associates known gene and Affy U74 (mouse) chip ID
  • knownToU95 - Associates known gene and Affy U95 chip ID
  • mgcFullMrna - Mammalian Gene Collection (MGC) full ORF mRNAs
  • mgcFullStatus - Status of MGC clones
  • mgcGenes - MGC full ORF mRNAs
  • MGI ID - Links MGI ID to Locus Link ID (Mouse)
  • miRNA - MicroRNAs from the miRNA Registry at the WellCome Trust Sanger Institute.
  • mm5_netBlastz - Scored reference file for mouse mm5 alignments
  • mmBlastTab - Blastp results of known genes vs. one another (mouse)
  • mostConserved - BED files of most conserved data.
  • mouseChain - Summary info about chain of mouse alignments
  • mouse_chm - Pairwise multiple alignments between current species and mouse
  • mouseChrom - Describes name and size of each mouse chromosome as well as an external file containing the chromosome
  • mouse_hmrg - Mouse scored reference file for human/chimp/mouse/rat/chicken alignments
  • mouseNet - Database representation of a net of alignments with mouse
  • mouse_netBlastz - Scored reference file for mouse alignments
  • mouseRinnSex - Rinn et al. sex gene expression data on MOE430A chip
  • mouseRinnSexExps - Rinn et al. sex gene experiment descriptive data on MOE430A chip
  • mouseRinnSexMedian - Rinn et al. sex gene expression data on MOE430A chip
  • mouseRinnSexMedianExps - Rinn et al. sex gene experiment descriptive data on MOE430A chip
  • mouseRinnSexMedianRatio - Rinn et al. sex gene expression data on MOE430A chip
  • mouseRinnSexRatio - Rinn et al. sex gene expression data on MOE430A chip
  • mouseSyn - Synteny between mouse and human chromosomes (Last used Apr. 2001)
  • mouseSynWhd - Whitehead synteny between mouse and human chromosomes
  • mrnaOrientInfo - Extra information on mRNAs calcluated by polyInfo program
  • mrnaRefseq - Associates mRNA IDs with RefSeq IDs
  • multiz5way - Index of multiz multiple alignments of 5 organisms. NOTE: To obtain the conservation scores associated with this assembly, download the data from the assembly's phastCons directory on the Genome Browser FTP server.
  • multiz8way - Index of multiz multiple alignments of 8 organisms. NOTE: To obtain the conservation scores associated with this assembly, download the data from the assembly's phastCons directory on the Genome Browser FTP server.
  • multiz8wayCons - "Wiggle track" display of evolutionary conservation in 8 organisms based on a phylogenetic hidden Markov model, using multiz alignments. NOTE: To obtain the conservation scores associated with this assembly, download the data from the assembly's phastCons directory on the Genome Browser FTP server.
  • mzPt1Mm3Rn3Gg2_pHMM - Index of multiz multiple alignments of human, chimp, mouse, rat and chicken. NOTE: To obtain the conservation scores associated with this assembly, download the data from the assembly's phastCons directory on the Genome Browser FTP server.
  • mzPt1Mm3Rn3Gg2_pHMM_wig - "Wiggle track" display of evolutionary conservation in human, chimp, mouse, rat, and chicken based on a phylogenetic hidden Markov model, using multiz alignments. NOTE: To obtain the conservation scores associated with this assembly, download the data from the assembly's phastCons directory on the Genome Browser FTP server.
  • multizHg15Mm3 - Multiz multiple alignments between human (hg15) and mouse (mm3)
  • multizMm3Rn2 - Multiz multiple alignments between mouse (mm3) and rat (rn2)
  • multizYeast - Multiz multiple alignments of 7 yeast species against S. cerevisiae (sc1)
  • mzDy1Dp2Ag1_phast - Index of multiz multiple alignments of D. melanogaster, D. pseudoobscura, and A. gambiae NOTE: To obtain the conservation scores associated with this assembly, download the data from the assembly's phastCons directory on the Genome Browser FTP server.
  • mzDy1Dp2Ag1_phast_wig - "Wiggle track" display of evolutionary conservation in D. melanogaster, D. pseudoobscura, and A. gambiae based on a phylogenetic hidden Markov model, using multiz alignments. NOTE: To obtain the conservation scores associated with this assembly, download the data from the assembly's phastCons directory on the Genome Browser FTP server.
  • mzHg17Mm5_phast - Evolutionary conservation in current species, human, and mouse based on a phylogenetic hidden Markov model, using multiz alignments. NOTE: To obtain the conservation scores associated with this assembly, download the data from the assembly's phastCons directory on the Genome Browser FTP server.
  • mzPt1Mm3Rn3Gg2_pHMM_wig - "Wiggle track" display of evolutionary conservation in current species, human, and mouse based on a phylogenetic hidden Markov model, using multiz alignments. NOTE: To obtain the conservation scores associated with this assembly, download the data from the assembly's phastCons directory on the Genome Browser FTP server.
  • nci60 - Microarray Experiments for NCI 60 Cell Lines
  • nci60Exps - Experiment descriptive data
  • netAnoGam1 - Database representation of a net of alignments with A. gambiae anoGam1 assembly
  • netCanFam1 - Database representation of a net of alignments with dog assembly canFam1
  • netCb1 - Database representation of a net of alignments with C. briggsae assembly cb1
  • netDanRer1 - Database representation of a net of alignments with zebrafish assembly danRer1
  • netDanRer2 - Database representation of a net of alignments with zebrafish assembly danRer2
  • netDm1 - Database representation of a net of alignments with D. melanogaster assembly dm1
  • netDp1 - Database representation of a net of alignments with D. pseudoobscura assembly dp1
  • netDp2 - Database representation of a net of alignments with D. pseudoobscura assembly dp2
  • netDroYak1 - Database representation of a net of alignments with D. yakuba assembly droYak1
  • netFr1 - Database representation of a net of alignments with Fugu assembly fr1
  • netHg15 - Database representation of a net of alignments with Huamn assembly hg15
  • netHg16 - Database representation of a net of alignments with Huamn assembly hg16
  • netHg17 - Database representation of a net of alignments with Huamn assembly hg17
  • netGalGal2 - Database representation of a net of alignments with chicken assembly galGal2
  • netMm3 - Database representation of a net of alignments with mouse assembly mm3
  • netMm4 - Database representation of a net of alignments with mouse assembly mm4
  • netMm5 - Database representation of a net of alignments with mouse assembly mm5
  • netMonDom1 - Database representation of a net of alignments with opossum assembly monDom1
  • netPanTro1 - Database representation of a net of alignments with chimp assembly panTro1
  • netRn3 - Database representation of a net of alignments with rat assembly rn3
  • netRxBestPt0 - Database representation of a net of alignments with chimp assembly pt0
  • netSelf - Database representation of a net of alignments of assembly with itself
  • netSyntenyDm1 - Database representation of a net of alignments with D. melanogaster assembly dm1
  • netSyntenyHg16 - Database representation of a net of alignments with human assembly hg16
  • netTetNig1 - Database representation of a net of alignments with Tetraodon assembly tetNig1
  • netXenTro1 - Database representation of a net of alignments with X. tropicalis assembly xenTro1
  • NIAGene - Alignments of the NIA Mouse Gene Index (v.3) with the selected mouse assembly
  • pbAaDistX - Proteome Browser distribution for a specific amino acid X
  • pbAnomLimit - Proteome Browser amino acid anomaly limits for each AA
  • pbResAvgStd - Proteome Browser residue average and standard deviation
  • pbStamp - Proteome Browser stamp information
  • pdbSP - Links a Swiss-Prot accession ID with a PDB ID
  • pepCCntDist - Proteome Browser cysteine count distribution
  • pepExonCntDist - Proteome Browser exon count distribution
  • pepHydroDist - Proteome Browser hydrophobicity distribution
  • pepIPCntDist - Proteome Browser InterProt domain count distribution
  • pepMolWtDist - Proteome Browser molecular weight distribution
  • pepMwAa - Proteome Browser molecular weight and AA length of proteins
  • pepPi - Proteome Browser protein pIs
  • pepPiDist - Proteome Browser pI distribution
  • pepPred - Proteome Browser predicted peptide linked to predicted gene
  • pepResDist - Proteome Browser residue distribution
  • perlegen - Perlegen Common High-Resolution Haplotype Blocks (chr21 only)
  • pfamDesc - Description of Pfam table
  • pfamXref - Links Pfam and Swiss-Prot IDs
  • pGC - Percentage of bases that are G or C in small regions across the genome
  • phastCons - "Wiggle track" display of evolutionary conservation based on a phylogenetic hidden Markov model, using multiz alignments
  • phastCons5 - "Wiggle track" display of evolutionary conservation in 4 organisms based on a phylogenetic hidden Markov model, using multiz alignments
  • phastCons8way - "Wiggle track" display of evolutionary conservation in 8 organisms based on a phylogenetic hidden Markov model, using multiz alignments
  • phastConsElements - Predictions of conserved elements produced by phastCons program
  • phastConsElements8way - Predictions of conserved elements in 8 organisms produced by phastCons program
  • phMouse - Pattern Hunter Mouse/Human Evolutionarily Conserved Regions (Last used Apr. 2001)
  • pjt_gene - NISC-generated gene annotations for zoo
  • protHomolog - SAM homolog data
  • pscreen - P-screen (BDGP Gene Disruption Project) P element insertion locations and genes
  • pseudoGeneLink - Links between predicted gene/pseudogene and ortholog/paralog information
  • pseudoMrna - Predicted pseudogene alignment information
  • pseudoYale - Alignments of pseudogenes identified in the Yale Pseudogene Database
  • quality - Genome quality scores
  • rankProp - RankProp protein ranking for a pair of proteins
  • rat_hmrg - Rat scored reference file for human/chimp/mouse/rat/chicken alignments
  • rat_netBlastz - Scored reference file for rat alignments
  • rBestNetPanTro1 - Database representation of a reciprocal best net of alignments with chimp assembly panTro1
  • recombRate - Describes the recombination rate in 1Mb intervals based on deCODE, Genethon and Marshfield maps
  • recombRateRat - Describes rat recombination rate based on SHRSPxBN F2 intercross or FHHxACI F2 intercross genetic maps.
  • refFlat - RefSeq genes, with additional name (as it appears in Genome Browser) field
  • refGene - RefSeq genes
  • refKnown - Known genes as derived from alignments of RefSeq mRNA (Dec 2000 and later known (RefSeq) gene track)
  • refLink - Relates RefSeq mRNA accession to LocusLink ID, HUGO Gene Nomenclature Committee symbol, etc. (Dec. 2000 and later)
  • refMrna - RefSeq mRNA (Dec. 2000 and later, not present in incrementally-updated assemblies)
  • refPep - RefSeq translated proteins (Dec. 2000 and later, not present in incrementally-updated assemblies)
  • refSeqAli - RefSeq translated proteins (Dec. 2001 and later)
  • refSeqStatus - Associates RefSeq mRNA accessions with corresponding RefSeq status (Dec. 2001 and later)
  • refSeqSummary - Sparse summary table created only for refSeqs that have "Summary:" or "COMPLETENESS:" in comment
  • regPotential2X - 2-way regulatory potential score computed from human/mouse alignments
  • regPotential3X - 3-way regulatory potential score computed from human/mouse/rat alignments
  • rgdEst - Rat Genome Database ESTs
  • rgdGene - Rat Genome Database genes
  • rgdQtl - Rat Genome Database QTLs
  • rgdSslp - Rat Genome Database simple sequence length polymorphisms (SSLPs)
  • rhMap - Alignments of radiation hybrid map sequences to the assembly
  • rikenaltid - Riken altid table (Mouse Feb. 2002 and later)
  • rikenann - Riken annotation table (Mouse Feb. 2002 and later)
  • rikencluster - Riken cluster table (Mouse Feb. 2002 and later)
  • rikenclusterseq - Riken cluster sequence regulation table (Mouse Feb. 2002 and later)
  • rikenseq - Riken sequence table (Mouse Feb. 2002 and later)
  • rinnSex - Rinn et al. sex gene expression data on MOE430A chip
  • rnaCluster - Gene boundaries deduced from clustering spliced ESTs and mRNAs against the genome (Apr. 2002 and later)
  • rnaGene - Functional RNA genes (Last used Apr. 2001)
  • rosetta - Rosetta Experimental Confirmation of Chr22 Exons
  • rosettaExps - Experiment descriptive data
  • rosettaExps - Rosetta Experimental Confirmation information
  • sage - Stores SAGE data in terms of UniGene identifiers
  • sacBay_pwMaf - Scored reference file for S. bayanus alignments
  • sacCas_pwMaf - Scored reference file for S. castelli alignments
  • sacKlu_pwMaf - Scored reference file for S. kluyveri alignments
  • sacKud_pwMaf - Scored reference file for S. kudriavzevii alignments
  • sacKud_pwMik - Scored reference file for S. mikatae alignments
  • sacKud_pwPar - Scored reference file for S. paradoxus alignments
  • sageExp - Data related to SAGE experiments, tissue descriptions, etc.
  • samSubdir - Associates SAM data subdirectory with Known Gene ID
  • sanger20 - Sanger Institute Chromosome 20 Genes
  • sanger20extra - Additional information about a Sanger 20 or 22 gene
  • sanger22 - Sanger Institute Chromosome 22 Genes
  • sanger22extra - Additional information about a Sanger 20 or 22 gene
  • sanger22pep - Translations of Sanger22 gene predictions into corresponding amino acid sequences
  • sanger22pseudo - Sanger Institute Chromosome 22 Pseudogenes
  • sangerBlastTab - Blastp results of Sanger genes vs. one another (C. elegans)
  • sangerCanonical - Describes the canonical splice variant of a Sanger gene (C. elegans)
  • sangerGene - Sanger gene predictions, containing protein ID (C. elegans)
  • sangerGenefinder - Sanger gene predictions, containing protein ID (C. elegans)
  • sangerIsoforms - Links together various transcripts of Sanger gene into a cluster (C. elegans)
  • sangerToKim - Associates Sanger IDs and Kim Lab expression IDs (which are mostly - but not all - the same) (C. elegans)
  • sangerToPfam - Associates Sanger IDs and Pfam ID (C. elegans)
  • sangerToRefSeq - Associates Sanger IDs and RefSeq accession (C. elegans)
  • scopDes - Structural Classification of Proteins description. See Lo Conte, Brenner et al. NAR 2002"
  • seq - Locations of database table sequences referenced in external fasta files
  • sfAssign - Superfamily assignment table
  • sfDes - Superfamily description table
  • sfDescription - Stores domain descriptions for the Superfamily track
  • sgdAbundance - Protein abundance data from http://yeastgfp.ucsf.edu via SGD (S. cerevisiae)
  • sgdBlastTab - Blastp results of SGD genes vs. one another (S. cerevisiae)
  • sgdCanonical - Describes the canonical splice variant of a gene (S. cerevisiae)
  • sgdClone - Clone information from the Saccharomyces Genome Database (S. cerevisiae)
  • sgdGene - Saccharomyces Genome Database gene predictions (S. cerevisiae)
  • sgdIsoforms - Links together various transcripts of a gene into a cluster (S. cerevisiae)
  • sgdLocalization - Associates known gene and SGD accession (S. cerevisiae)
  • sgdOther - Other features from Saccharomyces Genome Database (S. cerevisiae)
  • sgdOtherDescription - More features from Saccharomyces Genome Database (S. cerevisiae)
  • sgdPep - Translations of SGD gene predictions into corresponding amino acid sequences (S. cerevisiae)
  • sgdToName - Associates SGD gene name and mRNA (S. cerevisiae)
  • sgdToPfam - Associates SGD ID and Pfam ID (S. cerevisiae)
  • sgdToSgd - Associates SGD IDs (S. cerevisiae)
  • sgdToSwissProt - Associates SGD IDs and Swiss-Prot IDs (S. cerevisiae)
  • sgpGene - SGP gene predictions (GRIB)
  • sgpPep - Translations of SGP gene predictions into corresponding amino acid sequences
  • simpleRepeat - Simple tandem repeats generated with the TRF program
  • slamHuman - Slam predictions of coding exons in homologous DNA sequences (human genome)
  • slamHumanNonCoding - Slam predictions of conserved noncoding regions in homologous DNA sequences (human genome)
  • slamMouse - Slam predictions of coding exons in homologous DNA sequences (mouse genome)
  • slamMouseNonCoding - Slam predictions of conserved noncoding regions in homologous DNA sequences (mouse genome)
  • slamRat - Slam predictions of coding exons in homologous DNA sequences (rat genome)
  • slamRatNonCoding - Slam predictions of conserved noncoding regions in homologous DNA sequences (rat genome)
  • snapGene - Alignments of SNAP genes (Semi-HMM-based Nucleic Acid Parser) against the current genome
  • snp - SNP positions from various sources (hg17 human assemblies and later)
  • snpExceptions - Contains a set of queries to look for potentially problematic SNPs
  • snpMap - SNP positions from various sources (hg13, hg15, and hg16 human assemblies)
  • snpNih - Locations of SNPs detected by aligning reads from random genomic clones against the genome (replaced by snpMap starting with hg13)
  • snpTsc - Locations of SNPs detected primarily by looking at overlaps between clones that cover the same region (replaced by snpMap starting with hg13)
  • softberryGene - Fgenesh++ Gene Predictions
  • softberryHom - String with GenBank gi and accession
  • softberryPep - Translations of Fgenesh++ gene predictions into corresponding amino acid sequences
  • spDisease - Associates SwissProt IDs with a disease description
  • spMrna - Associates SwissProt IDs with mRNA IDs
  • spOrganism - Associates SwissProt IDs with a taxonomy number
  • spPsiBlast - PSI-BLAST E-value for 2 proteins
  • spSecondaryID - Associates SwissProt IDs with secondary accession IDs
  • spXref2 - Associates SwissProt IDs with other databases
  • stsAlias - STS marker aliases and associated identification numbers
  • stsInfo - Constant STS marker information
  • stsInfo2 - Constant STS marker information - revised version used in later releases
  • stsInfoMouse - Constant STS marker information (MGI)
  • stsInfoMouseNew - Constant STS marker information (NCBI)
  • stsInfoRat - Constant STS marker information - rat assembly
  • stsMap - STS markers on genetic (blue) and radiation hybrid (black) maps
  • stsMapMouse - STS markers and postions on mouse assembly (MGI)
  • stsMapMouseNew - STS markers and postions on mouse assembly (NCBI)
  • stsMapRat - STS markers and postions on rat assembly
  • stsMarker - STS marker mappings from various maps and FISH clone mappings from the BAC Resource Consortium (obsolete as of Apr. 2001, replaced by stsMap)
  • supercontig - Locations of supercontigs from WGS assembly
  • superfamily - Superfamily proteins
  • swInterPro - Associates Swiss-Prot accession IDs with InterPro IDs
  • syntenyHg16 - Human synteny alignnments (hg16)
  • syntenyHuman - Human synteny alignnments from blastz single coverage
  • syntenyMm3 - Synteny between mouse (mm3) and human chromosomes
  • syntenyMouse - Synteny between mouse and human chromosomes (First used in hg13)
  • syntenyRat - Synteny between rat and human chromosomes (First used in hg13)
  • tfbsCons - Transcription factor binding sites conserved in a multiple species alignment
  • tfbsConsMap - Transcription factor binding sites conservation map data
  • tigrGeneIndex - Alignments of the TIGR Gene Index against the selected genome
  • tilingPath - A tiling path of clones through a chromosome
  • transRegCode - Transcription factor binding sites from CHIP/CHIP experiments and conservation
  • transRegCodeCondition - Associates a transcription factor with a growth condition
  • transRegCodeMotif - Transcription factor binding motif according to Harbison Gordon et al.
  • transRegCodeProbe - CHIP/CHIP probe and trranscription factor binding info
  • twinscan - TwinScan Gene Predictions
  • twinscanPep - Translations of Twinscan gene predictions into corresponding amino acid sequences
  • uniGene - UniGene alignments and SAGE Info (Last used Apr. 2001)
  • uniGene_2 - UniGene alignments and SAGE Info (new version)
  • uniGene_dr - Zebrafish UniGene alignments
  • uniGene_gg - Chicken UniGene alignments
  • vegaGene - Annotated genes from the Vertebrate Genome Annotation (VEGA) database (Human chr14, 20, 22 only)
  • vegaInfo - Additional information for Vega genes (Human chr14, 20, 22 only)
  • vegaPep - Translations of Vega genes into corresponding amino acid sequences (Human chr14, 20, 22 only)
  • vegaPseudoGene - Annotated pseudogenes and immunoglobulin segments from the VEGA database (Human chr14, 20, 22 only)
  • vntr - Microsatellites from Gerome Breen's VNTR program
  • wabaCbr - C. briggsae WABA alignments - full description
  • wgRna - CD and H/ACA Box snoRNAs and microRNAs from Weber and Griffiths-Jones
  • wz_ests - Alignments of WZ ESTs from WUSTL to the zebrafish assembly
  • xenoEst - Nonhuman ESTs from GenBank
  • xenoMrna - Nonhuman mRNAs from GenBank
  • xenoRefFlat - RefSeq genes from species other than current species, with additional name (as it appears in Genome Browser) field
  • xenoRefGene - RefSeq genes from species other than current species
  • xenoRefSeqAli - RefSeq translated proteins for species other than current species
  • yeastChoCellCycle - Expression data
  • yeastChoCellCycleExps - Experiment descriptive data
  • yeastChoCellCycleRatio - Expression data
  • zebrafish_netBlastz - Scored reference file for zebrafish alignments
  • zoom1_affyTranscriptome - Affymetrix Transcriptome scores (large scale)
  • zoom2_affyTranscriptome - Affymetrix Transcriptome scores (medium scale)


 mRNA Sequence Tables

The tables in this section contain additional information about mRNA sequences.

  • author - List of authors
  • bacClone - Name of the Bacterial Artificial Chromosome clone that a sequence was grown in, if applicable (Last used Aug. 2001)
  • cds - Coding sequence in GenBank format
  • cell - Cell types
  • center - Sequencing center (Last used Aug. 2001)
  • chromosome - Chromosome ID and name (Last used Aug. 2001)
  • cytoMap - Position on a cytological map (Last used Aug. 2001)
  • description - Short (one line) description
  • development - Stage of development of organism
  • gbCdnaInfo - Very relational mRNA and EST information table. Consists mostly of the mRNA accession and pointers into other tables that contain additional mRNA information.
  • geneName - Name of gene
  • keyword - Associated keywords in GenBank
  • library - Clone library (mostly for ESTs)
  • moddate - Last modification date
  • mrnaClone - Clone name (mostly for ESTs)
  • organism - Name of organism
  • productName - Protein product name
  • sex - Sex of organism
  • source - Source field in GenBank, often fairly redundant with other fields
  • tissue - Tissue isolated from


 Internal Use Tables

The tables in this section are intended primarily for internal use.

  • axtInfo - Axt alignment names and sizes
  • certificate - Table for non-standard join certificates
  • extFile - References to external files, again only used by RNA.
  • gbExtFile - References to external GenBank files.
  • gbLoaded - GenBank incremental update information.
  • grp - Track group types in browser display window
  • history - Modification history of database, only used by RNA programs
  • tableDescriptions - Descriptive information about database tables form autoSQL/gbdDescriptions
  • trackDb - Describes an annotation track


 autoSql Descriptions of Tables

This section describes the format of each table in autoSql format.

Affymetrix 10K SNP Details

table affy10KDetails
"Information representing the Affymetrix 10K Mapping Array"
(       
    string     affyId;    "Affymetrix SNP id"
    string     rsId;      "RS identifier (some are null)"
    string     tscId;     "TSC identifier (some are null)"
    char[2]    baseA;     "The first allele (A)"
    char[2]    baseB;     "The second allele (B)"
    char[34]   sequenceA; "The A allele with flanking sequence"
    char[34]   sequenceB; "The B allele with flanking sequence"
    char[8]    enzyme;    "The enzyme that was used to prepare the sample (HindIII
			   or XbaI)"
)

Affymetrix 120K SNP Array Information

table affyGenoDetails
"Information representing the Affymetrix 120K SNP array"
(       
    uint       affyId;    "Affymetrix SNP id"
    uint       rsId;      "RS identifier (some are null)"
    char[2]    baseA;     "The first allele (A)"
    char[2]    baseB;     "The second allele (B)"
    char[34]   sequenceA; "The A allele with flanking sequence"
    char[34]   sequenceB; "The B allele with flanking sequence"
    char[8]    enzyme;    "The enzyme that was used to prepare the sample (HindIII
			   or XbaI)"
    float      minFreq;   "The minimum allele frequency"
    float      hetzyg;    "The heterozygosity from all observations"
    float      avHetSE;   "The Standard Error for the average heterozygosity (not
			   used)"
    char[2]    NA04477;   "Individual 01"
    char[2]    NA04479;   "Individual 02"
    char[2]    NA04846;   "Individual 03"
    char[2]    NA11036;   "Individual 04"
    char[2]    NA11038;   "Individual 05"
    char[2]    NA13056;   "Individual 06"
    char[2]    NA17011;   "Individual 07"
    char[2]    NA17012;   "Individual 08"
    char[2]    NA17013;   "Individual 09"
    char[2]    NA17014;   "Individual 10"
    char[2]    NA17015;   "Individual 11"
    char[2]    NA17016;   "Individual 12"
    char[2]    NA17101;   "Individual 13"
    char[2]    NA17102;   "Individual 14"
    char[2]    NA17103;   "Individual 15"
    char[2]    NA17104;   "Individual 16"
    char[2]    NA17105;   "Individual 17"
    char[2]    NA17106;   "Individual 18"
    char[2]    NA17201;   "Individual 19"
    char[2]    NA17202;   "Individual 20"
    char[2]    NA17203;   "Individual 21"
    char[2]    NA17204;   "Individual 22"
    char[2]    NA17205;   "Individual 23"
    char[2]    NA17206;   "Individual 24"
    char[2]    NA17207;   "Individual 25"
    char[2]    NA17208;   "Individual 26"
    char[2]    NA17210;   "Individual 27"
    char[2]    NA17211;   "Individual 28"
    char[2]    NA17212;   "Individual 29"
    char[2]    NA17213;   "Individual 30"
    char[2]    PD01;      "Individual 31"
    char[2]    PD02;      "Individual 32"
    char[2]    PD03;      "Individual 33"
    char[2]    PD04;      "Individual 34"
    char[2]    PD05;      "Individual 35"
    char[2]    PD06;      "Individual 36"
    char[2]    PD07;      "Individual 37"
    char[2]    PD08;      "Individual 38"
    char[2]    PD09;      "Individual 39"
    char[2]    PD10;      "Individual 40"
    char[2]    PD11;      "Individual 41"
    char[2]    PD12;      "Individual 42"
    char[2]    PD13;      "Individual 43"
    char[2]    PD14;      "Individual 44"
    char[2]    PD15;      "Individual 45"
    char[2]    PD16;      "Individual 46"
    char[2]    PD17;      "Individual 47"
    char[2]    PD18;      "Individual 48"
    char[2]    PD19;      "Individual 49"
    char[2]    PD20;      "Individual 50"
    char[2]    PD21;      "Individual 51"
    char[2]    PD22;      "Individual 52"
    char[2]    PD23;      "Individual 53"
    char[2]    PD24;      "Individual 54"
)

Affymetrix Transcriptome

table affyTranscriptome
"Describes (x,y) pairs of samples"
    ushort bin;                         "A field to speed indexing" 
    string  chrom;                      "Chromosome alignment is on"
    uint    chromStart;                 "Start position in chromosome"
    uint    chromEnd;                   "End position in chromosome"
    string  name;                       "Unique index given to each block of samples"
    uint    score;                      "Overall block score from 0 to 1000"
    char[1] strand;                     "Strand: + or -"
    uint    sampleCount;                "Number of samples in this block"
    uint[sampleCount] samplePosition;   "Base position of each sample in this
                                         block, relative to chromStart"
    uint[sampleCount] sampleHeight;     "Score for each sample (y-value) from 0 to
					 1000"
   )

Alternative Splicing

table altGraphX
"An alternatively-spliced gene graph"
    (
    string tName;                   	"Name of target sequence, often a chrom"
    int tStart;                     	"First bac touched by graph"
    int tEnd;                       	"Start position in first bac"
    string name;                    	"Human readable name"
    uint id;                        	"Unique ID"
    char[2] strand;                 	"+ or - strand"
    uint vertexCount;               	"Number of vertices in graph"
    ubyte[vertexCount] vTypes;      	"Type for each vertex"
    int[vertexCount] vPositions;    	"Position in target for each vertex"
    uint edgeCount;                 	"Number of edges in graph"
    int[edgeCount] edgeStarts;      	"Array with start vertex of edges"
    int[edgeCount] edgeEnds;        	"Array with end vertex of edges"
    table evidence[edgeCount] evidence; "Array of evidence tables containing references 
				  	 to mRNAs that support a particular edge"
    int[edgeCount] edgeTypes;       	"Type for each edge, ggExon, ggIntron, etc"
    int mrnaRefCount;               	"Number of supporting mRNAs"
    string[mrnaRefCount] mrnaRefs;  	"Ids of mrnas supporting this"
    int[mrnaRefCount] mrnaTissues;  	"Ids of tissues that mrnas come from, indexes 
					 into tissue table"
    int[mrnaRefCount] mrnaLibs;     	"Ids of libraries that mrnas come from, indexes 
					 into library table"
   )

Atlas Oncology Site

table atlasOncoGene
"Table used to link into ATLAS Oncology site"
   (
   string locusSymbol;  "LocusLink Symbol"
   string atlasGene;    "ATLAS Gene"
   string otherGene;    "Other gene"
   string url;          "URL for corresonding ATLAS web page"
   )

Bactig Positions

table bactigPos
"Bactig positions in chromosome coordinates (bed 4 +)."
    (
    string chrom;        "Chromosome"
    uint   chromStart;   "Start position in chromosome"
    uint   chromEnd;     "End position in chromosome"
    string name;         "Bactig"
    string startContig;  "First contig in this bactig"
    string endContig;    "Last contig in this bactig"
    )

BGI (Beijing Genomics Institute) SNPs

table bgiSnp
"Beijing Genomics Institute SNP information as bed 4 +"
    (
    string chrom;       "Chromosome"
    uint   chromStart;  "Start position in chromosome"
    uint   chromEnd;    "End position in chromosome"
    string name;        "BGI SNP name: snp.superctg.ctg.pos.type.strainID"
    char[1] snpType;    "S (substitution), I (insertion), or D (deletion)"
    uint readStart;     "Start position in alternate allele read"
    uint readEnd;       "End position in alternate allele read"
    uint qualChr;       "Quality score in reference assembly"
    uint qualReads;     "Quality score in alternate allele read"
    string snpSeq;      "'X->Y' or indel sequence"
    string readName;    "Name of alternate allele read"
    char[1] readDir;    "Direction of read relative to reference"
    char[4] inBroiler;  "SNP found in Broiler strain? yes, no or n/a if not covered"
    char[4] inLayer;    "SNP found in Layer strain? yes, no or n/a if not covered"
    char[4] inSilkie;   "SNP found in Silkie strain? yes, no or n/a if not covered"
    string primerL;     "Left primer sequence"
    string primerR;     "Right primer sequence"
    char[1] questionM;  "L for dubious indels, H for other indels and SNPs"
    string extra;       "Additional information"
    )

Bio Cyc Pathway Map

table bioCycMapDesc
"Decription of BioCyc pathway maps"
   (
   string mapID;        "BioCyc pathway map"
   string description;  "BioCyc pathway map description"
   )

Bio Cyc Pathway

table bioCycPathway
"BioCyc Pathway to Known Gene cross reference"
   (
   string kgID;         "Known Gene ID" 
   string geneID;       "Gene (RefSeq) ID"
   string mapID;        "BioCyc pathway map ID"
   )

Blast Output

The following definition is used for several tables that contain blast for various organisms.
table blastTab
"Tab-delimited blast output file"
    (
    string query;	"Name of query sequence"
    string target;	"Name of target sequence"
    float identity;	"Percent identity"
    uint aliLength;	"Length of alignment"
    uint mismatch;	"Number of mismatches"
    uint gapOpen;	"Number of gap openings"
    uint qStart;	"Start in query (0 based)"
    uint qEnd;		"End in query (non-inclusive)"
    uint tStart;	"Start in target (0 based)"
    uint tEnd;		"End in target (non-inclusive)"
    double eValue;	"Expectation value"
    double bitScore;	"Bit score"
    )

Browser Extensible Data (BED)

The following definition is used for several tables, which may use 4 or more of the fields.
table bed
"Browser extensible data"
    (
    string chrom;      "Chromosome or FPC contig"
    uint   chromStart; "Start position in chromosome"
    uint   chromEnd;   "End position in chromosome"
    string name;       "Name of item"
    uint   score;      "Score from 0-1000"
    char[1] strand;    "+ or -"
    uint thickStart;   "Start of where display should be thick (start codon)"
    uint thickEnd;     "End of where display should be thick (stop codon)"
    uint reserved;     "Always zero for now"
    int blockCount;    "Number of blocks (a block contains no gaps)"
    int[blockCount] blockSizes; "Comma-separated list of block sizes"
    int[blockCount] blockStarts; "Start positions relative to chromStart."
    int expCount;      "Number of experiments."
    int[expCount] expIds; "Experiment IDs."
    int[expCount] expScores; "Experiment scores."
    )

Celera Coverage

table celeraCoverage
"Summary of large genomic Duplications from Celera Data"
    (
    string chrom;   	"Chromosome or FPC contig"
    uint chromStart;    "Start position in chromosome"
    uint chromEnd;      "End position in chromosome"
    string name; 	"Source of information"
    )

Celera Duplications

table celeraDupPositive
"Summary of large genomic Duplications from Celera Data"
    (
    string chrom;   	"Chromosome or FPC contig"
    uint chromStart;    "Start position in chromosome"
    uint chromEnd;  	"End position in chromosome"
    string name;    	"Celera accession name"
    string fullname;    "Celera accession full name"
    float fracMatch;    "Fraction of matching bases"
    float bpalign;  	"Base pair alignment score"
    )

Celera Overlay

table celeraOverlay
"Celera assembly overlay in the public (NCBI) assembly"
    (
    string chrom;      	"Chromosome"
    uint chromStart;   	"Start position in chromosome"
    uint chromEnd;     	"End position in chromosome"
    string name;       	"Other chromosome and start"
    string otherChrom; 	"Other chromosome"
    uint otherStart;   	"Start in other chromosome"
    uint otherEnd;     	"End in other chromosome"
    char orient;       	"Strand/orientation (F or R)"
    char field1;       	"? (always M)"
    char field2;       	"? (r or u)"
    string localId;    	"?"
    string parentInfo; 	"?"
    string refId;      	"Reference ID"
    uint refStart;     	"Start position in reference"
    uint refLength;     "Length in reference"
    byte refOrient;   	"Strand/orientation in reference (-1 or 1)"
    string queryId;    	"Query ID"
    uint queryStart;    "Start position in query"
    uint queryLength;   "Length in query"
    byte queryOrient; 	"Strand/orientation in query (-1 or 1)"
    )

CGAP Alias

table cgapAlias
"Associates CGAP pathway IDs with gene symbols or mRNA accessions"
   (
   string cgapID;       "CGAP pathway ID"
   string alias;        "Gene symbol or mRNA"
   )

CGAP/BioCarta Pathway Descriptions

table cgapBiocDesc
"CGAP/BioCarta pathway description"
  (
  string mapID;         "CGAP/BioCarta pathway map ID"
  string description;   "Description"
  )

CGAP/BioCarta Pathway Cross-Reference

table cgapBiocPathway
"CGAP BioCarta pathway cross reference"
   (
   string cgapID;       "CGAP pathway ID"
   string mapID;        "BioCarta pathway ID"
   )

Chain Alignments

table chain
"Summary info about a chain of alignments"
   (
   double score; 	"Score of chain"
   string tName; 	"Target sequence name"
   uint tSize; 		"Target sequence size"
   uint tStart; 	"Alignment start position in target"
   uint tEnd; 		"Alignment end position in target"
   string qName; 	"Query sequence name"
   uint qSize; 		"Query sequence size"
   char qStrand; 	"Query strand"
   uint qStart; 	"Alignment start position in query"
   uint qEnd; 		"Alignment end position in query"
   uint id; 		"Chain ID"
   }

Chain Link

table chainLink
"Alignment block in chain"
   (
   string tName; 	"Target sequence name"
   uint tStart; 	"Alignment start position in target"
   uint tEnd; 		"Alignment end position in target"
   uint qStart; 	"Start in query"
   uint chainId; 	"Chain ID in chain table"
   )

Chimp/Human Simple Differences

table chimpSimpleDiff
"Simple differences between the chimp and human genome assemblies"
   (
   string chrom;        "Chromosome number"
   uint   chromStart;   "Start position in chromosome"
   uint   chromEnd;     "End position in chromosome"
   string   tseq;	"Target sequence (A,T,C,G)"
   string   qseq;	"Query sequence (A,T,C,G)"
   )

Chromosome Band Information

table cytoBand
"Describes the positions of cytogenetic bands within a chromosome"
   (
   string chrom;        "Chromosome number"
   uint   chromStart;   "Start position in genoSeq"
   uint   chromEnd;     "End position in genoSeq"
   string name;         "Name of cytogenetic band"
   string gieStain;     "Giesma stain results"
   )

Chromosome Information

table chromInfo
"Chromosome names and sizes"
    (
    string chrom;       "Chromosome name"
    uint   size;        "Chromosome size"
    string fileName;    "Chromosome file (raw one byte per base)"
    )

Clone Fragment Positions

table chrN_gl
"Fragment positions in golden path"
    (
    ushort  bin;          "A field to speed indexing added to many 
                           tables August 2001"
    uint    matches;      "Number of bases that match that aren't repeats"
    uint    misMatches;   "Number of bases that don't match"
    string  frag;         "Fragment name"
    uint    start;        "Start position in golden path"
    uint    end;          "End position in golden path"
    char[1] strand;       "+ or - for strand"
    )

Clone Information

table clonePos
"A clone's position and other info."
    (
    string  name;       "Name of clone including version"
    uint    seqSize;    "base count not including gaps"
    ubyte   phase;      "htg phase"
    string  chrom;      "Chromosome name"
    uint    chromStart; "Start in chromosome"
    uint    chromEnd;   "End in chromosome"
    char[1] stage;      "F/D/P for finished/draft/predraft"
    string  faFile;     "File with sequence."
    )

Contamination Information

table contamination
"Contamination in assembly (bed 4 +)"
    (
    string chrom;      "Chromosome"
    uint   chromStart; "Start position in chromosome"
    uint   chromEnd;   "End position in chromosome"
    string name;       "Name (type of contamination: E.coli, vector etc)"
    string acc;        "NCBI accession where contamination was found"
    string ctg;        "Alternate name (e.g. WGS contig name)"
    int    len;        "Length of contaminated portion of sequence"
    )

Contig/Accession Map

table contigAcc
"Maps a contig to its accession."
    (
    string contig;      "Contig name"
    string acc;         "GenBank accession"
    )

Contig Information

table ctgPos
"Where a contig is inside of a chromosome."
    (
    string contig;      "Name of contig"
    uint   size;        "Size of contig"
    string chrom;       "Chromosome name"
    uint   chromStart;  "Start in chromosome"
    uint   chromEnd;    "End in chromosome"
    )

Contig Information (Expanded)

table ctgPos2
"Where a contig is inside of a chromosome including contig type information."
    (
    string contig;      "Name of contig"
    uint   size;        "Size of contig"
    string chrom;       "Chromosome name"
    uint   chromStart;  "Start in chromosome"
    uint   chromEnd;    "End in chromosome"
    char[1] type;       "(W)GS contig, (F)inished, (P)redraft, (D)raft, (O)ther"
    )

CpG Islands

table cpgIsland
"Describes CpG Islands of 50 bases or more that are at least 50% G/C"
   (
   string chrom;        "Chromosome or FPC contig"
   uint   chromStart;   "Start position in chromosome"
   uint   chromEnd;     "End position in chromosome"
   string name;         "CpG Island"
   uint   minLength;    "Minimum Island Length"
   uint   cpgNum;       "Number of CpGs in island"
   uint   gcNum;        "Number of C and G in island"
   uint   perCpg;       "Percentage of island that is CpG"
   uint   perGc;        "Percentage of island that is C or G"
   )

CpG Islands (Extended)

table cpgIslandExt
"Describes the CpG Islands (includes observed/expected ratio)"
   (
   string chrom;        "Chromosome or FPC contig"
   uint   chromStart;   "Start position in chromosome"
   uint   chromEnd;     "End position in chromosome"
   string name;         "CpG Island"
   uint   length;       "Island Length"
   uint   cpgNum;       "Number of CpGs in island"
   uint   gcNum;        "Number of C and G in island"
   uint   perCpg;       "Percentage of island that is CpG"
   uint   perGc;        "Percentage of island that is C or G"
   float  obsExp;	"Ratio of observed(cpgNum) to expected(numC*numG/length) 
			 CpG in island"
   )

dbSnpRS

table dbSnpRS 
"Information from dbSNP at the reference SNP level"
    (
    uint    rsId;    	"RS identifier"
    float   avHet;   	"Average heterozygosity from all observations"
    float   avHetSE; 	"Standard Error for the average heterozygosity from all 
			 observations"
    string  valid;   	"Validation status of the SNP"
    char[1] base1;   	"Base of the first allele"
    char[1] base2;   	"Base of the second allele"
    string  assembly; 	"Sequence in the ucsc assembly"
    string  alternate; 	"Sequence of the alternate allele"
    )

DNA Motif

table dnaMotif
"A gapless DNA motif"
    (
    string name;                        "Motif name"
    int columnCount;                    "Count of columns in motif"
    float[columnCount] aProb;           "Probability of A's in each column"
    float[columnCount] cProb;           "Probability of C's in each column"
    float[columnCount] gProb;           "Probability of G's in each column"
    float[columnCount] tProb;           "Probability of T's in each column"
    )

DupSpMrna

table dupSpMrna
"Duplicate mRNA/Protein entries that have identical CDS structures"
   (
   string mrnaID;        "mRNA ID name"
   string proteinID;     "Protein ID from SwissProt"
   string dupMrnaID;     "mRNA ID of duplicated entry"
   string dupProteinId;  "Protein ID of duplicated entry"
   )

EncodeRegionInfo

table encodeRegionInfo
"Descriptive, assembly-independent information about ENCODE regions"
    (
    string name;        "Name of region"
    string descr;       "Description (gene region, random pick, etc.)"
    )

Ensembl Gene XRef

table ensGtp
"Associates Ensembl gene, transcript, and protein IDs"
   (
   string gene; 	"Gene ID"
   string transcript; 	"Transcript ID"
   string protein; 	"Protein ID"
   )

EnsemblXref

table ensemblXref
"Cross-reference info between transcript entries and translation entries of Ensembl 
genes"  
   (
   string db;			"Ensembl Database"
   int gene_id;			"Gene ID number"
   string gene_name;		"Gene name in database"
   int transcript_id;		"Transcript ID number"
   string transcript_name;	"Transcript name in database"
   uint translation_id;		"Translation ID number"
   string translation_name; 	"Translation name in database"
   string external_db;		"External database"
   string external_name;	"External status"
   )

EnsemblXref2

table ensemblXref2
"Cross-reference info between transcript entries and translation entries of Ensembl 
genes (alternative table used in some assemblies)"  
   (
   string transcript_name; 	"Transcript name"
   string translation_name; 	"Translation name"
   )

EnsemblXref3

table ensemblXref3
"A cross-reference table for Ensembl Genes."
   (
   string gene;            "Ensembl gene ID"
   string geneVer;         "Ensembl gene ID version number"
   string transcript;      "Ensembl transcript ID"
   string transcriptVer;   "Ensembl transcript ID version number"
   string protein;         "Ensembl protein ID"
   string proteinVer;      "Ensembl protein version number"
   string tremblAcc;       "TrEMBL  protein accession number"
   string swissDisplayId;  "Swiss-Prot protein display ID"
   string swissAcc;        "Swiss-Prot protein accession number"
   )

Ensembl gene_xref Table

Ensemble gene_xref table downloaded directly from the Ensembl ftp site (18.34 release 11/04). Refer to the Ensembl site for details. CAUTION: Ensembl sometimes changes its table definitions and some fields may not contain the data that the name indicates, e.g. translation_name.
table ensGeneXref
"Table gene_xref downloaded from Ensembl"
   (
   string db;                   "#"
   string analysis;             "#"
   string type;                 "#"
   int gene_id;                 "#"
   string gene_name;            "#"
   int[5] gene_version;         "#"
   int transcript_id;           "#"
   string transcript_name;      "#"
   int[5] transcript_version;   "#"
   int[5] translation_name;     "#"
   int translation_id;          "#"
   int[5] translation_version;  "#"
   string external_db;          "#"
   string external_name;        "#"
   char[10] external_status;    "#"
   )

Ensembl transcript Table

Ensembl transcript table downloaded directly from the Ensembl ftp site (18.34 release 11/04). Refer to the Ensembl site for details. CAUTION: Ensembl sometimes changes its table definitions and some fields may not contain the data that the name indicates.
table ensTranscript
"Table transcript downloaded from Ensembl"
   (
   int id;                      "#"
   string db;                   "#"
   string analysis;             "#"
   string type;                 "#"
   int transcript_id;           "#"
   string transcript_name;      "#"
   int transcript_version;      "#"
   string chr_name;             "#"
   int chr_start;               "#"
   int chr_end;                 "#"
   int chr_strand;              "#"
   int coding_start;            "#"
   int coding_end;              "#"
   int translation_id;          "#"
   string translation_name;     "#"
   string translation_version;  "#"
   int gene_id;                 "#"
   string gene_name;            "#"
   int gene_version;            "#"
   lstring exon_structure;      "#"
   lstring exon_ids;            "#"
   string external_db;          "#"
   string external_name;        "#"
   char[10] external_status;    "#"
   )

Exofish Information

table exoFish
"An evolutionarily conserved region (ecore) with Tetroadon"
   (
   string chrom;        "Chromosome or FPC contig"
   uint   chromStart;   "Start position in chromosome"
   uint   chromEnd;     "End position in chromosome"
   string name;         "Just a dummy dot"
   uint   score;        "Score from 0 to 1000"
   )

Exonerate Mouse Homologies

Last used Apr. 2001.
table exoMouse
"A rough alignment - not detailed"
   (
   string chrom;        "Chromosome or FPC contig"
   uint   chromStart;   "Start position in chromosome"
   uint   chromEnd;     "End position in chromosome"
   string name;         "Name of other sequence"
   uint   score;        "Score from 0 to 1000"
   char[1] strand;      " or -"
   uint otherStart;     "Start in other sequence"
   uint otherEnd;       "End in other sequence"
   )

Experiment Record

table expRecord
"Minimal descriptive data for an experiment in the browser"
   (
   uint id; 			"Internal id of experiment"
   string name; 		"Name of experiment"
   lstring description; 	"Description of experiment"
   lstring url; 		"URL relevant to experiment"
   lstring ref; 		"Reference for experiment"
   lstring credit; 		"Who to credit with experiment"
   uint numExtras; 		"Number of extra things"
   lstring[numExtras] extras; 	"Extra things of interest, i.e. classifications"
   )

Expression Data

table expData
"Expression data (no mapping, just spots)"
   (
   string name; 		"Name of gene/target/probe etc."
   uint expCount; 		"Number of scores"
   float[expCount] expScores; 	"Scores. May be absolute or relative ratio"
   )

Expression Distance Between 2 Genes

table expDistance
"Distance between two genes in expression space"
    (
    string query;       "Name of one gene"
    string target;      "Name of other gene"
    float distance;     "Distance in expression space, always >= 0"
    )

FISH Clones

table fishClones
"Describes the positions of fishClones in the assembly"
   (
   string chrom;                   "Chromosome number"
   uint   chromStart;              "Start position in chromosome"
   uint   chromEnd;                "End position in chromosome"
   string name;                    "Name of clone"
   uint score;                     "Always 1000"
   uint placeCount;                "Number of times FISH'd"
   string[placeCount] bandStarts;  "Start FISH band"
   string[placeCount] bandEnds;    "End FISH band"
   string[placeCount] labs;        "Lab where clone FISH'd"
   string placeType;               "How clone was placed on the sequence assembly"
   uint accCount;                  "Number of accessions associated with the clone"
   string[accCount] accNames;      "Accession associated with clone"
   uint stsCount;                  "Number of STS markers associated with this clone"
   string[stsCount] stsNames;      "Names of STS  markers"
   uint beCount;                   "Number of BAC end sequences associated with this 
				    clone"
   string[beCount] beNames;        "Accessions of BAC ends"
   )

FlyBase Accessions Linked to SwissProt Accessions

table flyBaseSwissProt
"FlyBase acc to SwissProt acc, plus some other SwissProt info"
    (
    string flyBaseId;	"FlyBase FBgn ID"
    string swissProtId;	"SwissProt ID"
    string spGeneName;	"Gene name from SwissProt (long)"
    string spSymbol;	"Symbolic-looking gene ID from SwissProt"
    )

FlyBase Alleles

table fbAllele
"The alleles of a gene"
    (
    int id;		"Allele ID"
    string geneId;	"FlyBase ID of gene"
    string name;	"Allele name"
    )

FlyBase Gene Synonyms

table fbSynonym
"Links all the names we call a gene to its FlyBase ID"
    (
    string geneId;	"FlyBase ID"
    string name;	"A name (synonym or real)"
    )

FlyBase Genes

table fbGene
"Links FlyBase IDs, gene symbols and gene names"
    (
    string geneId;	"FlyBase ID"
    string geneSym;	"Short gene symbol"
    string geneName;	"Gene name - up to a couple of words"
    )

FlyBase Genes Cross-Reference (2004)

table flyBase2004Xref
"FlyBase cross references circa late 2004 (dm2/4.0; dp3/1.0 uses a subset)"
    (
    string name;                "FlyBase annotation gene ID"
    string symbol;              "Symbolic gene name"
    lstring synonyms;           "Comma-separated list of synonyms"
    string fbtr;                "FlyBase FBtr acc"
    string fbgn;                "FlyBase FBgn acc"
    string fbpp;                "FlyBase FBpp acc"
    string fban;                "FlyBase FBan acc"
    string type;                "Annotation type (for noncoding only)"
    )

FlyBase Genes Linked to BDGP Transcript IDs

table fbTranscript
"Links FlyBase gene IDs and BDGP transcript IDs"
    (
    string geneId;	 "FlyBase gene ID"
    string transcriptId; "BDGP Transcript ID"
    )

FlyBase Genes Linked to GO IDs

table fbGo
"Links FlyBase gene IDs and GO IDs/aspects"
    (
    string geneId;      "FlyBase gene ID"
    string goId;        "GO ID"
    string aspect;      "P (process), F (function) or C (cellular component)"
    )

FlyBase Observed Phenotype in Mutant

table fbPhenotype
"Observed phenotype in mutant. Sometimes contains gene function info"
    (
    string geneId;	"FlyBase gene ID"
    int fbAllele;	"ID in fbAllele table or 0 if not allele-specific"
    int fbRef;		"ID in fbRef table"
    lstring text;	"Descriptive text"
    )

FlyBase Reference

table fbRef
"A literature or sometimes database reference"
    (
    int id;		"Reference ID"
    lstring text;	"Usually begins with FlyBase ref ID, but not always"
    )

FlyBase Role of Gene in Wildtype

table fbRole
"Role of gene in wildType"
    (
    string geneId;	"FlyBase gene ID"
    int fbAllele;	"ID in fbAllele table or 0 if not allele-specific"
    int fbRef;		"ID in fbRef table"
    lstring text;	"Descriptive text"
    )

FlyReg Data

table flyreg
"Flyreg data from Casey Bergman"
    (
    string chrom;      "Chromosome"
    uint   chromStart; "Start position in chromosome"
    uint   chromEnd;   "End position in chromosome"
    string name;       "Factor"
    string target;     "Target"
    uint   pmid;       "PubMed ID"
    )

FlyReg Data (version 2)

table flyreg
"Flyreg data (version 2) from Casey Bergman"
    (
    string chrom;      "Chromosome"
    uint   chromStart; "Start position in chromosome"
    uint   chromEnd;   "End position in chromosome"
    string name;       "Factor"
    string target;     "Target"
    uint   pmid;       "PubMed ID"
    uint   fpid;       "Footprint ID -- stable ID across versions"
    )

Fosmid End Pairs

table fosEndPairs
"Positions of end pairs for fosmids"
   (
   short  bin;        	      "Bin number for browser speedup"
   string chrom;      	      "Chromosome"
   uint   chromStart; 	      "Start position of fosmid in chromosome"
   uint   chromEnd;   	      "End position of fosmid in chromosome"
   string name;       	      "Name of fosmid"
   uint   score;      	      "Score = 1000/(# of times fosmid appears in assembly)"
   char[1] strand;    	      "Value should be + or -"
   string pslTable;   	      "Table which contains corresponding PSL records for linked 
			       features"
   uint   lfCount;    	      "Number of linked features in the series"
   uint[lfCount]   lfStarts;  "Comma-separated list of start positions of each linked 
				feature in genomic"
   uint[lfCount]   lfSizes;   "Comma-separated list of sizes of each linked feature 
				in genomic"
   string[lfCount] lfNames;   "Comma-separated list of names of linked features"
   )

Gap Positions

table chrN_gap
"Gaps in golden path"
    (
    ushort bin;         "A field to speed indexing added to many 
                         tables August 2001"
    uint   matches;     "Number of bases that match that aren't repeats"
    uint   misMatches;  "Number of bases that don't match"
    string chrom;       "which chromosome"
    uint   chromStart;  "start position in chromosome"
    uint   chromEnd;    "end position in chromosome"
    int    ix;          "ix of this fragment (useless)"
    char[1] n;          "always 'N'"
    uint   size;        "size of gap"
    string type;        "contig, clone, fragment, etc."
    string bridge;      "yes, no, mrna, bacEndPair, etc."
    )

GC Content

table gcPercent
"Displays GC content in 20Kb non-overlapping blocks for genome"
   (
   string chrom;        "Chromosome number"
   uint   chromStart;   "Start position in chromosome"
   uint   chromEnd;     "End position in chromosome"
   string name;         "Constant string GC"
   uint   gcPpt;        "Average number of G and C per 1000 bases"
   )                                                                            

GenBank Protein Annotations

table gbProtAnn
"Protein Annotations from GenPept mat_peptide fields"
    (
    string chrom;      	"Chromosome"
    uint   chromStart; 	"Start position in chromosome"
    uint   chromEnd;   	"End position in chromosome"
    string name;       	"Name of item"
    string product;    	"Protein product name"
    string note;       	"Note (may be empty)"
    string proteinId; 	"GenBank protein accession(.version)"
    uint giId;        	"GenBank db_xref number"
    )

Gene Bands

table geneBands
"Band locations of known genes"
   (
   string name;         "Gene name (HUGO Gene Nomenclature Committee symbol, if possible)"
   string mrnaAcc;      "RefSeq mRNA accession"
   int    count;        "Number of times this accession maps to the genome"
   string[count] bands; "List of chromosome bands to which it maps"
   )                                                                            

Gene Boundaries as Defined by RNA and Spliced EST Clusters

table rnaCluster
"Gene boundaries deduced from clustering spliced ESTs and mRNAs against the genome"
    (
    smallint bin;           "Index"
    string chrom;           "Chromosome or FPC contig"
    uint   chromStart;      "Start position in chromosome"
    uint   chromEnd;        "End position in chromosome"
    string name;            "Name of item"
    uint   score;           "Score from 0-1000"
    char[1] strand;         "+ or -"
    uint thickStart;        "Start of where display should be thick (start codon)"
    uint thickEnd;          "End of where display should be thick (stop codon)"
    uint reserved;          "Always zero for now"
    int blockCount;         "Number of blocks (a block contains no gaps)"
    longblob blockSizes;    "Size of each block"
    longblob chromStarts;   "Start position of each block in chromosome"
    ) 

Gene Predictions and RefSeq Genes

The following definition is used for gene prediction tables. In alternative-splicing situations, each transcript has a row in this table.
table genePred
"A gene prediction."
    (
    string  name;               "Name of gene"
    string  chrom;