Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

What is suppuration, Suppuration High number of PMN cells have been det...

Suppuration High number of PMN cells have been detected around implants that are associated with severe signs of mucosal inflammation. Several studies have shown the presence o

Consists of the production of atp, Select all that are true/correct: Cellul...

Select all that are true/correct: Cellular respiration consists of the production of ATP in several steps using enzymes. Photosynthesis produces carbohydrates using carbon dioxide

Explain cultivation of microorganisms, Explain Cultivation of microorganism...

Explain Cultivation of microorganisms It is for  the production of food colourants has attractions, these must be measured against the financial legislative and user constraint

Explain dose-scaling, Dose-scaling Toxicological equivalent doses in a...

Dose-scaling Toxicological equivalent doses in animals and humans are a debatable issue. The Joint FAOIWHO Expert Committee on Food Additives (JECFA) and Joint FA01 WHO Meetin

What is the name of the dna duplication process, What is the name of the DN...

What is the name of the DNA duplication process? What is the main enzyme that participates in it? The process of copying, or duplication, of the DNA molecule is called replica

Chlamydiosis-clinical manifestations, Clinical manifestations Since the...

Clinical manifestations Since the disease due to Chlamydia involves many organs/systems, system-wise descriptions of clinical signs is described as under: Genital infectio

Which are the brain regions associated with memory, Which are the brain reg...

Which are the brain regions associated with memory? According to researchers some of the main regions of the nervous system associated with the memory phenomenon are the hippoc

Explain about the endocrine system, Why is the endocrine system considered ...

Why is the endocrine system considered one of the integrative systems of the body? What is the other physiological system that also has this function? The endocrine system is s

Pyruvate carboxylase activation, Oxaloacetate has two main roles. It is an ...

Oxaloacetate has two main roles. It is an intermediate which is consumed in gluconeogenesis and it is also a key intermediate in the citric acid cycle where it fuses with acetyl Co

Define the basic function of keratin in the epidermis, Q. What is the funct...

Q. What is the function of keratin in the epidermis? The epidermis is the outer layer of the skin made of epithelial tissue in the epidermis there are keratin-secreting cells (

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd