Implementation of huffman coding, Advanced Statistics

Assignment Help:

Input to the compress is a text le with arbitrary size, but for this assignment we will assume that the data structure of the file fits in the main memory of a computer. Output of the program is a compressed representation of the original file. You will have to save the codetable in the header of the compressed file, so that you can use the codetable for decompressing the compressed file. Input to the decompress is a compressed file, from which the program recovers the original file. For sanity

check, you should have a specific magic word at some position in the header of the compressed file, so that decompress can identify whether the given file is a valid Huffman compressed file. You should pay attention to the following issues:

The file that we will use for testing can be very large, having size in Gigabytes, so make sure that your program is bug-free and it works for large input le.

Write efficient algorithm, we will take off as much as 20 points if we feel that the program is taking unusually long time.

You must make sure that your program runs on a Linux Machine, and identically follows the formatting instructions. For formatting error, as much as 15 points can be taken off .

You must provide a Make file to compile your programs. Also, a README.txt le should be provided that will have the instruction to compile and run the programs. 


Related Discussions:- Implementation of huffman coding

Dummy variables, The variables resulting from the recoding categorical vari...

The variables resulting from the recoding categorical variables with more than two categories into the sequence of binary variables. Marital status, for instance, if originally lab

Doubly ordered contingency tables, The contingency tables in which the row ...

The contingency tables in which the row and column both the categories follow a natural order. An instance for this might be, drug toxicity ranging from mild to severe, against the

EDUC 606, The GRE has a combined verbal and quantitative mean of 1000 and a...

The GRE has a combined verbal and quantitative mean of 1000 and a standard deviation of 200.

Two - stage distillation process, A mixture of benzene, toluene, and xylene...

A mixture of benzene, toluene, and xylene enters a two-stage distillation process where some of the componentsare recovered. The distillation process operates at steady-state condi

Differences total spot, The graphical process most frequently used in the a...

The graphical process most frequently used in the analysis of data from a two-by-two crossover design. For each of the subject the difference between the response variable values o

Blinder oaxaca method, Blinder Oaxaca method: A method or technique used f...

Blinder Oaxaca method: A method or technique used for assessing the effect of the role of income on racial wealth gap. The method or technique is based on the decomposition of the

Contour plot, Contour plot : A topographical map drawn from data comprising...

Contour plot : A topographical map drawn from data comprising observations on the three variables. One variable is represented on horizontal axis and the second variable is represe

Conditional probability, Conditional probability : The probability that an ...

Conditional probability : The probability that an event occurs given the outcome of other event. Generally written, Pr(A|B). For instance, the probability of a person being color b

Explain missing values, Missing values : The observations missing from the ...

Missing values : The observations missing from the set of data for some of the reason. In longitudinal studies, for instance, they might occur because subjects drop out of the stud

Write Your Message!

Captcha
Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd