GAP Lecture 2
GAP Lecture 2
Cell contains only a single set of genome: Usually circular or contigous, linear
DNA.
1
• Mordern molecular biology, the genome of an organism refers to the
hereditary information encoded in a DNA.
• HANS WINKER : Genome includes both the gene and the non-coding
sequences of the DNA.
2
GENOMICS
• Refers to the study of global properties of genome of related organisms.
• No: of genes and no: of basepairs vary widely from one sps to another.
3
Types of Genomes
• Organisms more complex than viruses carry additional genetic material
besides that resides in the chromosomes.
• Genome also includes all the genes and information on non-coding DNA that
have the potential to be present.
4
Genomes and genetic variation
• Genome does not include the genetic diversity as the genetic polymorphism of
a species.
• (eg) The human genome sequence in principle could be determined from just
half the information on the DNA of one cell from one individual.
5
Genome sequencing projects
• Scientific endeavours that ultimately aim to determine the complete genome
sequence of the organism(animal, plant, fungus, bacteria, archea, virus etc).
• Genome sequence of any organism requires the DNA sequences for each of
the chromosomes in an organism to be determined.
6
Genome Assembly
• Process of taking large no: of short DNA sequences : generated by shotgun
sequencing projects.
7
• Tandem repeats : assembly not easy :
• Scaffolds are positioned along the physical map of the chromosome creating a
golden path.
8
Assembly Software
• 1) Phred/Phrep by Phil green : 1st successful assemblers used in 1990’s and
early 2000’s especially for smaller genomes.
9
Genome Annotation
• Process by which you attach biological information to sequences.
10
When is the genome project complete ?
• Does not involve just confining to determining the DNA sequence of an
organism.
• 1st genome project : phage x174 with 5368 bp by Fred Sanger in 1977.
11
Genome Evolution
• Genomic traits can be measured and studied without reference to any
particular genes and their products.
• Predict what mechanisms could have produced the variety of genomes that
exist today.
12
Bacterial Genome
• Genome size shows a strong positive relationship with gene number.
( Contains little non-coding region).
• Genome sizes do not vary anywhere near as much as thos eof animals(order of
20 fold to 7000 fold).
• Pattern explanation:
argument: some bacteris have small genomes : selection for rapid cell
division:
Larger DNA content take longer to replicate : slows down cell cycle.
13
• Mira et al (2001) compared doubling time and genome size in bacteria in the
lab : found no significant relationship between them.
BUT
14
Genome Map
• Helps navigate around the genome
• Could be short DNA sequences and regulatory sites that turn genes on or off.
15
Difference between a genome map and genome sequence
• Sequence: Spells out the order of every DNA base in the genome.
• (eg) Possible to determine the location of the gene to map a gene without
sequencing it.
• Map may tell nothing about a sequence and a sequence nothing about a map.
16
• GCCATTGACGTCCCCTTGAAA
CGGTAACTGCATGGGAACTTT
• GCC…………….CCCC………….
………………CAGGG………AAA
• In general, in humans and other sps with large genomes, creating a reasonably
comprehensive genome map is quicker and cheaper than sequencing the entire genome.
• Mapping therefore, involves less information to collect and organise than sequencing.
• A human genome study aims at both comprehensive genome map and complete genome
sequence.
17
Why Map a genome when one can sequence?
• Redundant, yet a map can help sequence a genome.
• (eg) Genomes sequenced with clone by clone method, a map is required to determine
where the clone belongs in the genome.
• The more accurate and detailed the map, the easier it is to put the gigsaw in place.
• One can still use maps though to assemble sequence to the proper place in the genome.
• Especially in humans: Long string of letters difficult to inderstand and identify genes and
other interesting features from junk.
• Landmark provides clues regarding important parts of the genome sequence and
location.
18
Use of Genome Maps
• Helps in finding genes especially those involved in human diseases.
• Essentially a hot and cold approach wherein many families affected by the
disease are studied.
• The inheritance of the disease and various landmarks are traced through
several generations.
• Landmarks that tend to be inherited along with the disease are likely to be
located close to the disease gene an become markers for that gene.
• Thus, search reduced from entire 3 billion bp to a region with just a few
million base pairs long.
19
• Next step:
• Is to look for genes in that part of the genome and study one by one and
identify the one involved in the disease.
• (eg) compare affected people with healthy ones to look for a gene with a
function related to the disease.
• Genes for cystic fibrosis, Hugtingtons disease and other inherited diseases
have been identified by this method.
20
• In future, detailed genome mapping would help scientists identify genes faster
and study complex human diseases like cancer, heart disease, personaliy traits
etc.
• Clones ( chunks of DNA that have been chopped and spliced into DNA of
bacterial cell).
21