Gene names and symbols

Gene names and symbols

Caution! Gene naming is very complex. Consult a specialist!

Gene names usually describe the function of the gene. Use lower case, apart from proper nouns and acronyms:

haemoglobin gene     alpha 1 gene

Gene symbols vary in different organisms. International nomenclature committees for specific organisms set guidelines, including approved gene names, symbols, case and type style.

Bacteria

Bacterial genes are named using a 3-letter designation, usually an abbreviation for the pathway or for the phenotype of mutants. The 3-letter designation is written in lower case and italics. Different genes that affect the same pathway are distinguished by a capital letter following the 3-letter designation (without a space). An allele number can also be added (also without a space) to designate a particular mutation:

lac     lacZ     lacZ19

It is important to distinguish the phenotype of a bacterial strain from its genotype. The phenotype is usually indicated with the same 3-letter designation as the genotype, but phenotypes begin with a capital letter and are not italicised. Wildtype alleles can be designated by superscript ‘+’ or ‘–’:

Lac+     Cys

Other designations can also be added using superscripts, but must be defined:

Strr [streptomycin resistance]

Plants

Plant gene nomenclature follows the same basic guidelines as animal gene nomenclature.

Animals, including humans

A gene symbol should be no more than 6 characters, and most guidelines now call for 3-letter symbols. Most guidelines specify capital letters and italics, but there are a number of exceptions (eg mice and rats). Symbols should start with the first letter of, and reflect, the gene name. Greek symbols and roman numerals should be avoided (arabic numbers are acceptable), as should commas, hyphens and superscripted or subscripted characters:

HBA1 [hemoglobin, alpha 1 gene]

Human gene symbols follow these general rules and are italicised:

CREBBP [CREB binding protein gene]

HTT [Huntington disease gene]

Mouse and rat gene names follow the same basic rules as for other species, but gene symbols have a slightly different format. The main difference is that mouse and rat gene symbols have only the initial letter capitalised. Hyphens, superscripts and subscripts are also allowed when referring to alleles or pseudogenes:

Tlr2 [toll-like receptor 2 gene]

Hba-ps3 [hemoglobin alpha pseudogene 3]

For flies, the gene name and symbol are in sentence case if the gene is named after the protein or if the gene was first named for a mutant phenotype that is dominant to the wild-type phenotype. A gene name and symbol start with a lower-case letter if the gene was first named for a mutant phenotype that is recessive to the wild-type phenotype. Gene symbols are italicised:

Actin 5C [gene name]     Act5C [gene symbol]

will die slowly [gene name]     wds [gene symbol]

Zebrafish gene names are in lower case and italicised. The gene symbols are also in lower case and italicised, and use 3 or more letters:

engrailed 1a [gene name]     eng1a [gene symbol]

If there is a need to distinguish between species (ie for homologues), include the species in brackets after the gene name:

 LFNG (Drosophila) [lunatic fringe homologue of Drosophila]

Return to top

User login

... or purchase now

An individual subscription is only A$60 per year

Group and student discounts may apply

Australian manual of scientific style Start communicating effectively

Purchase