Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Explain the systemic antibiotic cover, Explain the systemic antibiotic cove...

Explain the systemic antibiotic cover The need for  systemic antibiotic cover should be considered. The original protocols recommended an antibiotic such as amoxicillin 250 mg

Define sterol regulatory element binding proteins (srebps), Define Sterol R...

Define Sterol Regulatory Element Binding Proteins (SREBPS)? These belong to a general family of transcription factors. They are synthesized as membrane embedded proteins in res

Circulation - initiation of cpr, Circulation: If central pulses (femor...

Circulation: If central pulses (femoral  in infants and  carotid in  children) are not  palpable, begin chest compression without losing any time.  In  young infants, encircle

Discuss the genetics involved in the inheritance, Casein is the main protei...

Casein is the main protein in milk. There are three types of casein, one of which is beta casein. Beta casein comes in two main variants called A1 and A2. The order of amino acids

Explain metastatic carcinoma, Explain Metastatic carcinoma Metastatic ...

Explain Metastatic carcinoma Metastatic carcinoma:- Cancer that can be  transferred from one part of the body to other unrelated parts.

What are the main risk factors for hypertension, Q. What are the main risk ...

Q. What are the main risk factors for hypertension? The major risks factors for hypertension are tobacco smoking, stress, obesity, sedentary lifestyle and alcoholism.

Diagnosis of diabetes mellitus, Q. Diagnosis of diabetes mellitus? Time...

Q. Diagnosis of diabetes mellitus? Timely and proper diagnosis plays a key role not only in identifying new cases but also in managing old cases with or without diabetic compli

What are mineral salts, What are mineral salts? Where in living beings can ...

What are mineral salts? Where in living beings can mineral salts are found? Mineral salts are simple inorganic substances made of metallic chemical elements, such as iron, sodi

Explain protostomes vs. deuterostomes breifly, Explain Protostomes vs. Deut...

Explain Protostomes vs. Deuterostomes breifly? Branching Evolutionary Lines : It appears that the Coelomate animals separated into two divergent lines of evolution in terms o

What is patient''s plasma osmolarity after the infusion, Tina administered ...

Tina administered 1 liter of sterile distilled water IV to a patient. Predict the direction (increase, decrease, no change) you would expect Tina's infusion to have produced in the

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd