Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Respiration, lesson plan practical requirements

lesson plan practical requirements

Explain a review of eating disorder, Explain a review of Eating Disorder? ...

Explain a review of Eating Disorder? If you talk to a group of young boys and girls in an informal setting about their physical appearance, you will find that a majority of the

Volvox Colonial Existence, Necessity of interdependence of Volvox organisms...

Necessity of interdependence of Volvox organisms in the colonial existence

Syngamy - protozoan, Syngamy - Protozoan The two gametes may be morpho...

Syngamy - Protozoan The two gametes may be morphologically similar (isogametes) or dissimilar (anisogametes). The gametes also vary in form, they may be flagellated or amoeboi

Genetics , Genetics Genetics is the study of heredity. It is an ancien...

Genetics Genetics is the study of heredity. It is an ancient discipline. At least 4000 years ago, in Sumeria, Egypt and other parts of the world, farmers recognized that they

Define about the sporocyst larva, Define about the Sporocyst Larva? Sp...

Define about the Sporocyst Larva? Sporocyst is the second larval stage in the life cycle of F. hepatica. It develops from the miracidium larva within the pulmonary chamber of

Show water permeability of the luminal membranes, Healthy Person H takes a ...

Healthy Person H takes a new drug named ANTICAMPCOLLDUCT that blocks the production of cyclic AMP (cAMP) in collecting duct epithelial cells in response to vasopressin binding to V

Low protein products, Foods with moderate amounts of protein can be eaten i...

Foods with moderate amounts of protein can be eaten in limited amounts. These foods include grains, bread, pasta, rice, potatoes, corn, and peas. Foods with little or no protein ar

What is the classification of burns, What is the Classification of Burns? ...

What is the Classification of Burns? Burns can be classified on the basis of the extent, depth, patient age and associated illness or injury. On the basis of depth, burns are u

What do you mean by recovery period, Q. What do you mean by Recovery Period...

Q. What do you mean by Recovery Period? At the instant exercise is discontinued, the ECG recorded is turned on and left running for a few seconds while the blood pressure is re

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd