Goal: Learn how to do statistics, sorting and curve fitting in Matlab. Put all of those ".m" files in a folder called yourname_lab7, and zip the folder and submit it on the class website.
Write a script called "yourname_lab7_1.m" that will do the following. Create two vectors with 20 random integers in each; in one the integers should range from 1 to 5, and in the other from 1 to 500. Would you expect the mean and median to be approximately the same? Would you expect the standard deviation of the two vectors to be approximately the same? Answer these questions in the comments of your program, and then use the built-in functions to find and print out the minimum, maximum, mean, median, standard deviation, and mode of each. Draw a histogram for each vector in a subplot. Run the script a few times to see the variations.
Sort DNA sequences
DNA is a double-stranded helical polymer that contains basic genetic information in the form of patterns of nucleotide bases. The patterns of the base molecules A, T, C, and G encode the genetic information.
From the course web page download the file "seq.txt", in which each line contains a short DNA sequence. In your script "yourname_lab7_2.m", Import the data into a cell array using the command
seqs = importdata('seq.txt');
1) Sort these sequences alphabetically.
2) Some DNA sequences are duplicated in the list. Find the most frequent sequence and the number of times it appears in the list. Output your answer nicely using the fprintf function.