Implementation of huffman coding, Advanced Statistics

Assignment Help:

Input to the compress is a text le with arbitrary size, but for this assignment we will assume that the data structure of the file fits in the main memory of a computer. Output of the program is a compressed representation of the original file. You will have to save the codetable in the header of the compressed file, so that you can use the codetable for decompressing the compressed file. Input to the decompress is a compressed file, from which the program recovers the original file. For sanity

check, you should have a specific magic word at some position in the header of the compressed file, so that decompress can identify whether the given file is a valid Huffman compressed file. You should pay attention to the following issues:

The file that we will use for testing can be very large, having size in Gigabytes, so make sure that your program is bug-free and it works for large input le.

Write efficient algorithm, we will take off as much as 20 points if we feel that the program is taking unusually long time.

You must make sure that your program runs on a Linux Machine, and identically follows the formatting instructions. For formatting error, as much as 15 points can be taken off .

You must provide a Make file to compile your programs. Also, a README.txt le should be provided that will have the instruction to compile and run the programs. 


Related Discussions:- Implementation of huffman coding

Clinical vs. statistical significance, Clinical vs. statistical significanc...

Clinical vs. statistical significance : The distinction among results in terms of their possible clinical importance rather than simply in terms of their statistical importance. Wi

Latin square, Latin square  is an experimental design targeted at removing ...

Latin square  is an experimental design targeted at removing from the experimental error the variation from two extraneous sources so that a more sensitive test of the treatment ef

Construct a stem-and-leaf diagram, The number of employees absent from work...

The number of employees absent from work at a large electronics manufacturing plant over aperiod of 106 days is given in the table below. 146 141 139 140 145 141 142 131 142 140

Unequal probability sampling, Unequal probability sampling is the sampling...

Unequal probability sampling is the sampling design in which the different sampling units in the population have different probabilities of being included in sample. The differing

Statistical & Quantitative Methods , Given: There are 4 jobs and 4 persons...

Given: There are 4 jobs and 4 persons. The cost incurred for each person and each job is as follows: Persons Job 1 Job 2 Job 3 Job 4 A 10 9 21 11 B 15 12 25 17 C 12 10 20 12 D 17

Historigram, difference between histogram and historigram

difference between histogram and historigram

Explain randomized response technique, Randomized response technique : The ...

Randomized response technique : The procedure for collecting the information on sensitive issues by means of the survey, in which an element of chance is introduced as to what quer

Computer-assisted interviews, Computer-assisted interviews : A method or te...

Computer-assisted interviews : A method or technique of interviewing subjects in which the interviewer reads the question from the computer screen instead of the printed page, and

Decision theory, A unified approach to all problems of prediction, estimati...

A unified approach to all problems of prediction, estimation, and hypothesis testing. It is based on concept of the decision function, which tells the performer of experiment how t

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd