Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Biological collections, Why do scientists make biological collections? Wha...

Why do scientists make biological collections? What are the scientific reasons for doing a collection? How is a collection done?

What is the importance of water for enzymatic activity, What is the importa...

What is the importance of water for enzymatic activity? Enzymes, biological catalysts, rely on water to reach their substrates and bind to them. There is no enzymatic activity

Hylonema., i hyalonema labled diagram with his genral characters and c...

i hyalonema labled diagram with his genral characters and classification

Peristaltic movements of the intestines, Q. Which is the kind of muscle tis...

Q. Which is the kind of muscle tissue that performs the peristaltic movements of the intestines? The smooth muscle tissue is responsible for the peristaltic movements of the in

Pericardium, The heart is enclosed in a membranous sac called the pericardi...

The heart is enclosed in a membranous sac called the pericardium. It has two layers- the fibrous pericardium which is the outer layer and the serous pericardium that lies inside th

Deficiency diseases-downer cow syndrome, Downer cow syndrome This syndr...

Downer cow syndrome This syndrome is a common sequel to milk fever. The term is commonly applied to those hypocalcaemic cows that remain recumbent for several hours before bein

What is consanguineal marriage, What is consanguineal marriage? Why is the ...

What is consanguineal marriage? Why is the appearing of genetic disease more probable in the offspring of a consanguineal marriage? The Consanguineal marriage is the marriage b

State the bio-medical waste management, Bio-medical waste management Pe...

Bio-medical waste management Persons coming in contact with bio-medical waste are prone to get injury from sharps likes needles. Injury due to sharps leads to life threatening

Quaternary structure of protein, Quaternary Structure (4 o Structure)....

Quaternary Structure (4 o Structure). Globular in structure. When two or more than two molecules of protein of tertiary structure are connected to each other through

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd