Indexing a page on internet, Basic Computer Science

Indexing a page on internet:

Once, the spiders have completed the task of finding information about Web pages, the search engine must store the information in such way that you are able to use it. The search engine may provide some information relating to relevance of information may be in the form of Ranking. Thus, a search engine may store the keywords of a web page, number of times that word appeared on the page, the URL of the page. A weighting factor that gives more weightage in case a word is found at the top of the document. Each commercial search engine uses a different formula for assigning weight to the keywords in its index. This is one of the reasons that a search for the same word on different search engines will produce different results.  Since, the data that is to be stored for indexing is large, therefore, search engine may encode it. The Index is created with the sole purpose, that is, it allows you to find the information on the Internet quickly. In general, Index uses hashing (you will study this concept in the Data Structure course).   

Posted Date: 10/23/2012 6:28:24 AM | Location : United States







Related Discussions:- Indexing a page on internet, Assignment Help, Ask Question on Indexing a page on internet, Get Answer, Expert's Help, Indexing a page on internet Discussions

Write discussion on Indexing a page on internet
Your posts are moderated
Related Questions
Arithmetic and Logic Unit: The Arithmetic and Logic Unit is that part of the CPU that actually performs arithmetic and logical operations on data. The CU, CPU registers and me

E-MAIL: Internet has changed the art of writing letters to email.  Email is one of the ways on Internet to send messages to another person across the network. E-mail and posta

Heuristic Search Strategies-Artificial intelligence In general speaking, a heuristic search is one which utilizes a rule of thumb to improve an agent's performance in solving p

Data buses: The availability of reliable digital semi-conductor technology has enabled the inter-communication task between different equipment to be significantly improved. P

Specifying Search Problems In our agent terms, a problem to be solved is a particular task where the agent starts with the environment in a given state and acts upon the enviro

Data Processing In any computer-based system, data storage and retrieval plays an important role. Data storage involves decision about the encoding of data, assignment o

QUESTION 1 Write short notes on controlled vocabulary indexing. QUESTION 2 (a) List five types of abstracts. (b) Describe four types of abstracts. QUESTION 3

In this program, you are going to write a program for playing the game of Hangman. In this game, the computer will pick a word and display a sequence of blanks to be filled in; one

Why is it called out of band protocol? A1) FTP uses port 20 and port 21; port 20 is used for data connection, whereas port 21 is used for control connection. FTP is known as out-of

QUESTION (a) Can SSL be used to encrypt email data? Justify your answer (b) What are the three basic security provided by SSL? (c) State the port number used by applicati