Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Water loss - plant water relation, Water Loss - Plant Water Relation ...

Water Loss - Plant Water Relation Plants lose about 98% of water to the atmosphere by transpiration. Often water loss by transpiration exceeds gain by absorption and results

Evolution of man, EVOLUTION OF MAN - Mammals evolved from primitive...

EVOLUTION OF MAN - Mammals evolved from primitive reptiles (therapsida) in Triassic period, about 210 million years ago. Mammals existed as inconspicuous group of small

What is an antigen, Q. What is an antigen? Antigen is any substance, in...

Q. What is an antigen? Antigen is any substance, infectious or particle agent recognized as foreign to the body. The contact of the antigen with the body promotes a defense rea

Which compounds results from cyclization of glucose, Which of the following...

Which of the following compounds results from cyclization of glucose? Select one: a. alpha-D-glucopyronose b. beta-D-glucopyronose c. alpha-D-glucofuranose d. beta-D

Laws governing ocular movements, Laws Governing Ocular Movements Herin...

Laws Governing Ocular Movements Hering's Law of Equal Innervation states that equal and simultaneous innervation flows from the brain to yoke muscles in all binocular movement

Show major complications of hypertension, Q. Show Major complications of hy...

Q. Show Major complications of hypertension? High blood pressure is one of the leading causes of kidney failure, also commonly called end-stage renal disease (ESRD).  Major

Explain anticonvulsants and cerebral palsy, Anticonvulsants and Cerebral Pa...

Anticonvulsants and Cerebral Palsy Anticonvulsants : drugs that prevent reduce or stop convulsions or seizures. Cerebral Palsy: A group of chronic foundations affecting

Determine about the term - heterogeneous mixture, Determine about the term ...

Determine about the term - heterogeneous mixture The solid phase made up of soil and other source material for nutrients, is a heterogeneous mixture. Each nutrient source contr

Explain some limitations of most probable number, Explain some Limitations ...

Explain some Limitations of Most Probable Number? 1. Large quantity of glassware is required. 2. Colony morphology cannot be determined. 3. It lacks precision.

Development of psychiatric nursing in other countries, DEVELOPMENT OF PSYCH...

DEVELOPMENT OF PSYCHIATRIC NURSING IN OTHER COUNTRIES: Although the scientific findings rejected the belief that mental disturbance was the devil's work, the earliest asylums

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd