Implement the chou-fasman algorithm

Assignment Help Biology
Reference no: EM13705784

The Chou-Fasman Method for Secondary Structure Prediction:

The basic idea is that each amino acid residue is assigned three numbers that describe its propensity to be part of α-helices, β-sheets and tight β-turns. A large number (above 100) for a particular structure means the amino acid has a high preference for that structure, e.g. α-helix, which is typically coupled to low propensity for the other structures. These Chou-Fasman parameters may be determined from the occurrence of different amino acids in different types of secondary structure in known protein structures. The prediction algorithm then contains the following steps:

1. Assign propensities to all residues in the sequence.

2. Scan the peptide and identify regions where 4 out of 6 contiguous residues exhibit P(α)>100, i.e. large probability for α-helix. Extend these nucleation regions in both directions until the average propensity for a set of four contiguous residues is P(α) < 100.

3. Scan the peptide and identify regions where 3 out of 5 contiguous residues have P(β)>100. These regions nucleate β-strands. Extend these in both directions until a set of four contiguous residues have an average P(β) < 100. This ends the β-strand - almost exactly the same as for the helices, but there might now be overlapping segments.

4. Any region containing overlapping α and β assignments are taken to be helical or β depending on if the average P(α) and P(β) for that region is largest. If this reduces an α- or β-region so that it becomes less than 5 residues, the α- or β-assignment for that region is removed.

5. To identify a tight β-turn at residue number i, the product f(turn) = f(i)f(i+1)f(i+2)f(i+3) is calculated. Remember that a tight turn uses two amino acids, and then there is one before and another after the turn too. These numbers are simply proportional to the frequency of each amino acids occurring in each such position of a β-turn.

6. To predict a β-turn, the following three conditions have to be simultaneously fulfilled:

(a) f(turn) > 0.000075, (b) the average value for the turn propensity p(turn) > 100 for the four amino acids and (c) the average p(turn) is larger than the average p(α) as well as p(β), i.e. it is more probable that we have a turn than any other structure.

7. The remaining parts of the sequence are considered coils.

Your task for this activity is to write a script in awk to implement the Chou-Fasman algorithm. You can find values for the Chou-Fasman parameters on the last page.

3-letter code

1-letter
code

p(a)

p(13)

p(t)

f(i)

f(i+1)

f(i+2)

f(i+3)

ALA

A

142

83

66

0.06

0.076

0.035

0.058

ARG

R

98

93

95

0.070

0.106

0.099

0.085

ASP

D

101

54

146

0.147

0.110

0.179

0.081

ASN

N

67

89

156

0.161

0.083

0.191

0.091

CYS

C

70

119

119

0.149

0.050

0.117

0.128

GLU

E

151

37

74

0.056

0.060

0.077

0.064

GLN

Q

111

110

98

0.074

0.098

0.037

0.098

GLY

G

57

75

156

0.102

0.085

0.190

0.152

HIS

H

100

87

95

0.140

0.047

0.093

0.054

ILE

I

108

160

47

0.043

0.034

0.013

0.056

LEU

L

121

130

59

0.061

0.025

0.036

0.070

LYS

K

114

74

101

0.055

0.115

0.072

0.095

MET

M

145

105

60

0.068

0.082

0.014

0.055

PHE

F

113

138

60

0.059

0.041

0.065

0.065

PRO

P

57

55

152

0.102

0.301

0.034

0.068

SER

S

77

75

143

0.120

0.139

0.125

0.106

THR

T

83

119

96

0.086

0.108

0.065

0.079

TRP

W

108

137

96

0.077

0.013

0.064

0.167

TYR

Y

69

147

114

0.082

0.065

0.114

0.125

VAL

V

106

170

50

0.062

0.048

0.028

0.053

Reference no: EM13705784

Previous Q& A

  How long in seconds would it take for the concentration

Question- The rate constant for this first order reaction is .830 s -1 at 400 C. How long in seconds would it take for the concentration of A to decrease from .690 M to .370M

  What was the mass percent of nh4c1 in the original sample

Question- You start with a 2.354 g sample of a mixture containing SiO,, NaCl, and NH,Cl. If after subliming away the NH,Cl, you have 1.975 grams of sample remaining, what was the mass percent of NH4C1 in the original sample

  Interval for the mean incubation period of sars virus

Based on interviews with 81 SARS patients, researchers found that the mean incubation period was5 days, with a standard deviation of 15.7 days. Based on this information construct a 95% confidence interval for the mean incubation period of SA..

  Define what is the molar mass of the solute

Question- A solution is prepared from 59.8 g of nonvolatile, nondissociating solute and 85.0 g of water. The vapor pressure of the solution at 60 degrees Celsius is 132 torr. The vapor pressure of water at 60 degrees Celsius is 150 torr. What is t..

  Two highest molecular ion peaks for bromobutane

Question- What will be the two highest molecular ion peaks for bromobutane. What will be their abundance

  Let x be the sample mean of a random sample of size n

Let X be the sample mean of a random sample of size n from a uniform distributionon the interval [0, b], and let bb = 2X be an estimator of the upper endpoint b.(a) Demonstrate that bb is an unbiased estimator of b, and find the variance of th..

  The solutions corresponding to the anode and the cathode

Question- A solid piece of Copper (Cu) was used as the electrode in each side of the cell and the answers to party a and b are the solutions corresponding to the anode and the cathode.

  What partial pressure of nitrogen gas is required

Question- What partial pressure of nitrogen (N2) gas is required in order for 0.00345g of the gas to dissolve in 14.0 mL of pure water. The Henry's law constant for nitrogen gas is 6.1 * 10^-4 atm -1.

  Explain the major product resulting from the addition

Discuss why hydroboration of 1-hexene provides the non-Markovnikov product and not the Markovnikov product. Contrast the major product of hydroboration (addition of H-OH) obtained in this experiment with the major product resulting from the additi..

  What are the equilibrium concentrations of each species

Question- Boric acid has pKa=9.23. What are the equilibrium concentrations of each species in .115M H3BO3 solution

Reviews

Write a Review

 

Similar Q& A

  Why are homologous characteristics

Why are homologous characteristics more useful than analogouscharacteristics in constructing phylogenetic trees? What types oftrees might result from using analogous characteristics, and why?

  Define the location of the mutation

To further discuss and define the location of the his mutation, the mutant strain was used as a recipient in a bacteriophage P1 transduction experiment.

  Which of the following is a population

Which of the following is a population.

  Describe the function of each of the following major

Describe the function (role) of each of the following major vegetative tissues in plants: epidermis, cortex, xylem, phloem, pith, mesophyll.

  Why is the heart described as a double pump describe the

why is the heart described as a double pump? describe the general structure of arteries and veins. how are they

  Q1 when patients who are receiving glucocorticoid therapy

q1. when patients who are receiving glucocorticoid therapy for in case with prednisone need to stop taking it the

  What is their optimal temperature and ph values

Explain why both B.coagulans and P.vulgaris are both facultative anaerobes? what is their optimal temperature and pH values?

  Choose three of the five main structures within the

select three of the five major structures within the integumentary system and complete the following skin sebaceous

  What is overall change in free energy of biochemical pathway

In a biochemical pathway, three ATP molecules are hydrolyzed. The endergonic reactions in the pathway require a total of 17.3 kcal/mole of energy to drive the reactions of the pathway.

  Mode of inheritance produces heterozygotes with phenotypes

Determine which mode of inheritance produces heterozygotes with phenotypes that differ from homozygote but more closely resembles one homozygous phenotype than other?

  Identify the main biological organic compounds

Identify the main biological organic compounds and identify and explain three functions of lipids and why lipids do not mix with water.

  Explain what occurs during the krebs

Explain what occurs during the Krebs (citric acid) cycle andelectron transport chain by describing the following.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd