Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

Skeletal system - girdles, GIRDLES - (i ) PECTORAL GIRDLE - ...

GIRDLES - (i ) PECTORAL GIRDLE - 4 bones. It is located on posterolateral part of upper region of the throax. It consists of scapula & clavicle. Scapula is placed

Spermatogenesis, Spermatogenesis: In the mature male functional sper...

Spermatogenesis: In the mature male functional sperm cells are produced within the seminiferous tubules of the testes. Around the periphery of the seminiferous tubules are l

Metanephridia - excretion, Normal 0 false false false E...

Normal 0 false false false EN-IN X-NONE X-NONE MicrosoftInternetExplorer4

Renal function & cardiovascular - change related with ageing, Define Renal ...

Define Renal Function & cardiovascular - change related with ageing? Changes associated with the cardiovascular and renal function: The progressive accumulation of athermanous

What is the typical biological function of the tissues, What is the typical...

What is the typical biological function of the connective tissues? How is this function associated to the main features of its cells? The typical function of the connective tis

What are integral membrane proteins, Membrane proteins are classify as eith...

Membrane proteins are classify as either integral (intrinsic) or peripheral (extrinsic) depending on how tightly they are linked with membrane. The Integral membrane proteins are s

Design a microcontroller-based system, Suppose you are in charge of designi...

Suppose you are in charge of designing a microcontroller-based system for a packet processing application. These are high speed internet packets from the PC that are going to be pr

Eletrons., Atom has how many eletrons

Atom has how many eletrons

Describe the origin of muscle cramps and pains, Q. How can the knowledge ab...

Q. How can the knowledge about fermentation describe the origin of muscle cramps and pains after intense physical exertion? A typical fermentation process due to oxygen scarcit

What is mdrtb, What is MDRTB MDRTB - For patients with known exposure t...

What is MDRTB MDRTB - For patients with known exposure to multi-drug resistant TB (MDRTB) and a high risk of developing active TB, there are no data-based recommendations. Regi

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd