Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Which molecule should be most radioactive, Yeast is cultured in the presenc...

Yeast is cultured in the presence of radioactive phosphate and the following biological molecules are purified. Which molecule should be most radioactive? A. An oligosaccharide B.

What are the hormones secreted by the adrenal medulla, What are the hormone...

What are the hormones secreted by the adrenal medulla? What are their respective functions? The medullary portion of the adrenals secretes hormones of the catecholamine group:

Response to heavy metal stress, Response to Heavy Metal Stress Severa...

Response to Heavy Metal Stress Several heavy metals emanating from industrial mining and sewage disposal operations contaminate the environment. Cadmium is a common contamina

How can a great biological diversity protect an ecosystem, How can a great ...

How can a great biological diversity protect an ecosystem from environmental damage? Why are less biodiverse ecosystems at risk of suffering deep biological harm if submitted to ev

Explain about the blanching - food processing, Explain about the Blanching ...

Explain about the Blanching - Food Processing? Blanching is used for variety of purposes. It is defined as a mild heat treatment applied to tissue (usually plant) prior to free

Explain oleic - linoleic fats, Oleic - Linoleic Fats Fats in this grou...

Oleic - Linoleic Fats Fats in this group are the most abundant. The oils are all of vegetable origin and contain large amounts of oleic and linoleic acids, and less than 20%

Sedative hypnotics - psychological drug dependence, SED A TIV E HYPNOTIC...

SED A TIV E HYPNOTICS - They depress the activity of CNS. Reduce excitement, give feeling of calm. Higher doses induces sleep. Sleep inducing drugs are also called

Explain the axonal terminal portion, What is an example of a situation in w...

What is an example of a situation in which the neuron cell body is located in a part of the body and its axonal terminal portion is in another distant part of the body? Why does th

Explains the difference among dominant and recessive alleles, Which of the ...

Which of the following best explains the difference among dominant and recessive alleles? A. The recessive allele encodes a protein with normal activity whereas the dominant al

Basic principles of planning diets for diabetes, Q. Basic principles of pla...

Q. Basic principles of planning diets for diabetes? The above objectives can be met by adhering to some of the basic principles of planning diets which include the consideratio

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd