Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Illustrate the composition of agar, Composition of agar Agar we learnt ...

Composition of agar Agar we learnt consists of a mixture of agrose and agropectin.  Agrose has a linear polymer structure  consisting of alternating D-galactose and 3,6-anhydro

Explain the swab method, Explain the Swab Method? Swab method is the ol...

Explain the Swab Method? Swab method is the oldest and widely used method in food and dairy industry and was developed by W.A. Manheimer and T. Ybanez in 1917. A sterile cotton

Explain nutritional management of metabolic diseases, Q. Explain nutritiona...

Q. Explain nutritional management of metabolic diseases? The nutritional management of metabolic diseases such as gout and a few inborn errors of metabolism such as phenylketon

What are the causes of obesity - etiology, What are the causes of obesity -...

What are the causes of obesity - Etiology? However simple the question may sound, the answer to it is not all that simple. We cannot deny that excess weight results from positi

Gonial apospory and somatic apospory, Gonial Apospory and Somatic Apospory ...

Gonial Apospory and Somatic Apospory Gonial Apospory There is no meiosis. The megaspore mother cell enlarges; a small vacuole appears above and below the nucleus. The c

Define nutrient needs of a lactating mother, Define nutrient needs of a lac...

Define nutrient needs of a lactating mother? Energy and Protein Needs: Remember that during pregnancy, well-nourished women will have laid down approximately 2-4 kg of fat. Thi

Proteins, Proteins Proteins are continually synthesised in the cells a...

Proteins Proteins are continually synthesised in the cells as they are the principal component required for growth. Proteins are composed of amino acids which are derived larg

Define intermediary metabolites of an athletes, Define Intermediary Metabol...

Define Intermediary Metabolites of an Athletes? Coenzyme Q10 - for the physically active; Co-Q 10 activates cell energy. While you carry out any physical act; run, jump, throw,

Functional morphology of reproductive organs, Normal 0 false ...

Normal 0 false false false EN-IN X-NONE X-NONE MicrosoftInternetExplorer4

Morphological nature of endosperm, Morphological Nature of Endosperm ...

Morphological Nature of Endosperm The morphological nature of endosperm in angiosperms has been a subject of much discussion in evolution. The endosperm in gymnosperms is a g

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd