Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Define briefly about the nutrition security, Define Briefly about the Nutri...

Define Briefly about the Nutrition security? Nutrition security can be briefly defined as a balance between biological requirements in energy and nutrients and the quantity and

What are the benefits of plant-incorporated protectants, What are the benef...

What are the benefits of plant-incorporated protectants? Plant-incorporated protectants can be a promising pest management alternative where traditional pesticides may not be a

Coelenterata, give outline classification of coelenterata

give outline classification of coelenterata

Botulism, B o tu l i s m It is a toxicity in chickens, turkeys, d...

B o tu l i s m It is a toxicity in chickens, turkeys, ducks and other aquatic birds caused by a bacterial toxin produced by anaerobic bacteria Clostridium botulinum mai

Find out the electric potential, Find the electric potential, taking zero a...

Find the electric potential, taking zero at infinity, at the upper right corner (the corner without a charge) of the rectangle in the figure. (Let y = 3.7 cm and x = 6.7 cm.)

Explain nevirapine and its adverse effects, Explain Nevirapine and its adve...

Explain Nevirapine and its adverse effects Nevirapine (NVP,Viramune) - Nevirapine is most effective at raising CD4 cell counts and lowering viral load when combined with 2 NRTI

Discovery of plant hormones, Discovery of Plant Hormones To date, five...

Discovery of Plant Hormones To date, five major classes of plant hormones have been discovered namely auxins, gibberellins, cytokinins, abscisic acid and ethylene. It is possi

How many respectively are genotypical and phenotypical forms, In F2 generat...

In F2 generation of a hybridization for a given trait conditioned by a pair of alleles T and t, according to Mendel's first law what are the genotypes of each phenotypical form? An

Explain hormonal proteins, Explain Hormonal proteins Hormonal proteins ...

Explain Hormonal proteins Hormonal proteins coordinate the bodily activities. Various peptide and protein hormones (such as insulin and growth hormone) carry information tha

What were coopers views on nature, What were coopers views on nature (parti...

What were coopers views on nature (particularly in regard to human progress and resource use)? And Is there any thought about rights of nature or conservation of resources in her w

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd