Examine multiple variation parameters for a genomic region , Biology

Assignment Help:
  1. Determine SNP variation among the aligned DNAs for a genomic region.   See below for how to count SNP variation.  The output file (Your_name_snp.txt) should have two columns of numbers.  The first column will indicate total number of SNP sites per species and the second will be the percent of sequences/species having that same number of variant nucleotides.
  2. Determine in-del variation among the aligned DNAs for a genomic region. The output file (Your_name_in_del.txt) should be two columns of numbers.  The first column will indicate total number of in-del sites per species and the second will be the percent of sequences/species having that same number of in-del.
  3. Determine overall variation (SNPs and in-dels) among the aligned DNAs for a genomic region. The output file (Your_name_both.txt) two columns of numbers.  The first column will indicate total number of variant sites (SNP and in-del) per species and the second will be the percent of sequences/species having that same number of variant nucleotides.  This will generate the same data used for the figure on page 3.

Sample Alignment: 48 bases,  differences are highlighted

Seq1      ATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq2      AAAAATGCATGCATGCATGCATGCATGCATGCATGCATGCATGCATGC

Seq3      AAAAATGCATGCATGCA-GCATGCATGCATGCATGCATGCATGCATGC

Seq4      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCATGCATGCATGC

Seq5      AAAAATGCATGCATGCA-GCATGCATGCATTTTTGCAT-CATGCATGC

Computation:  Compare Seq1 to 2,3,4, and 5 you find the differences (SNPs and InDels).

Seq1:Seq1 = 0 changes

Seq1:Seq2 = 3 changes

Seq1:Seq3 = 4 changes

Seq1:Seq4 = 7 changes

Seq1:Seq5 = 8 changes

 Repeat using each of the other sequences as the basis for comparison

Seq2:Seq1 = 3 changes                  Seq3:Seq1 = 4 changes

Seq2:Seq2 = 0 changes                  Seq3:Seq2 = 1 changes

Seq2:Seq3 = 1 changes                  Seq3:Seq3 = 0 changes

Seq2:Seq4 = 4 changes                  Seq3:Seq4 = 3 changes

Seq2:Seq5 = 5 changes                  Seq3:Seq5 = 4 changes

 

Seq4:Seq1 = 7 changes                  Seq5:Seq1 = 8 changes

Seq4:Seq2 = 4 changes                  Seq5:Seq2 = 5 changes

Seq4:Seq3 = 3 changes                  Seq5:Seq3 = 4 changes

Seq4:Seq4 = 0 changes                  Seq5:Seq4 = 1 changes

Seq4:Seq5 = 1 changes                  Seq5:Seq5 = 0 changes

 

Our input file is a FASTA format file of all sequences/species that has been previously aligned and trimmed.  There are some odd characters in the file, so we'll have to deal with that.


Related Discussions:- Examine multiple variation parameters for a genomic region

How can you describe the homeostasis, Q. What is the homeostasis? What are ...

Q. What is the homeostasis? What are the sensors, effectors and controllers of homeostasis? Homeostasis comprises the processes by which the organism maintains extracellular an

Define risk management, Define Risk management Risk management is def...

Define Risk management Risk management is defined  for the purposes of the Codex Alimentations Commission as  "the process, distinct from  risk assessment,  of weighing polic

Prior to the solving of the structure of dna, Prior to the solving of the s...

Prior to the solving of the structure of DNA, Erwin Chargaff calculated the amounts of the four nucleotides (A, T,G,C) and made which of the following conclusions? A. All four

Define excretion of iron in human body, Define Excretion of Iron in Human B...

Define Excretion of Iron in Human Body? Our body has a limited capacity to excrete iron once it has been absorbed. Daily losses in adult man are between 0.9 to 1.05 mg. About 0

Which of the following structures in a vertebrate, Which of the following s...

Which of the following structures in a vertebrate with a four-chambered heart would have blood with the highest oxygen concentration? And why? A. Arteriole end of a capillary B. Ri

What is genetic mapping, How can the concept of recombination frequency be ...

How can the concept of recombination frequency be used in genetic mapping? Genetic mapping is the determination of the location of the genes in a chromosome. By determining

What do you mean by neurotransmitters, Q. What do you mean by neurotransmit...

Q. What do you mean by neurotransmitters? - Nicotinic receptors (nicotine mimics the effects of Ach here). Found at NM-junction, ANS ganglions in general. Binding of Ach to

Mechanism of phloem transport, Mechanism of Phloem Transport The effic...

Mechanism of Phloem Transport The efficiency and magnitude of translocation of food material are evident from the annual yields of various crops and fruits. Now, the question

Explain the rapport building, Q. Explain the Rapport Building? Maintain...

Q. Explain the Rapport Building? Maintaining good relationship with the client is very important in counselling. This will help you to gain trust, confidence of the client and

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd