3.4.3
Search Engine Indexing
Search Engine Indexing
Search Engine Indexing
In order for search engines to find useful results when users carry out web searches, they have to store details of as many web pages as possible.
Indexing
Indexing
- In order for search engines to find useful results when users carry out web searches, they have to store details of as many web pages as possible.
- Search engine indexing refers to the methods used to maintain such a database.
- Search engine indexing allows results to be found efficiently.
Web crawlers
Web crawlers
- Search engines make use of software known as web crawlers, which traverse the web by visiting web pages and following the links contained on each.
- The web crawlers store key information about each page in an index.
- This is then used by the search engine to find results for a web user’s search.
Indexed information
Indexed information
- The information stored may include:
- Keywords in the webpage.
- Meta tags within the code of the page.
- How recently the website is updated.
PageRank Algorithm
PageRank Algorithm
The PageRank algorithm is a method of ranking the web pages in a set of Google search results.
PageRank algorithm
PageRank algorithm
- PageRank is named after Larry Page who, along with fellow Stanford University student Sergey Brin, developed the algorithm in the mid-1990s.
- The algorithm judges how important each web page is.
- The higher the score for a particular page, the closer to the top of the list of results it appears.
Operation
Operation
- Google searches often contain millions of results, so PageRank is an important method of ensuring useful pages are more prominent in the list.
- The algorithm has to be run constantly for each web page as the number of external links to the page can change at any time.
Algorithm
Algorithm
- The PageRank algorithm is written as follows:
- Using ‘Web page A’ as an example:
- is to the PageRank of ‘Web page A’.
- is the PageRank of any of pages which link to ‘Web page A’.
- is the number of outbound links on the pages which link to ‘Web page A’.
- is the damping factor, which is usually set to 0.85.
1Components of a Computer
1.1Structure & Function of the Processor
1.2Types of Processors
2Software & Software Development
2.1Systems Software
2.2Applications Generation
2.3Software Development
3Exchanging Data
3.1Compression, Encryption & Hashing
3.3Networks
4Data Types, Data Structures & Algorithms
4.1Data Types
5Legal, Moral, Cultural & Ethical Issues
5.1Computing Related Legislation
6Elements of Computational Thinking
6.1Thinking Abstractly
6.2Thinking Procedurally
6.3Thinking Logically
7Problem Solving & Programming
7.1Programming Techniques
7.2Programming Construction
Jump to other topics
1Components of a Computer
1.1Structure & Function of the Processor
1.2Types of Processors
2Software & Software Development
2.1Systems Software
2.2Applications Generation
2.3Software Development
3Exchanging Data
3.1Compression, Encryption & Hashing
3.3Networks
4Data Types, Data Structures & Algorithms
4.1Data Types
5Legal, Moral, Cultural & Ethical Issues
5.1Computing Related Legislation
6Elements of Computational Thinking
6.1Thinking Abstractly
6.2Thinking Procedurally
6.3Thinking Logically
7Problem Solving & Programming
7.1Programming Techniques
7.2Programming Construction
Unlock your full potential with GoStudent tutoring
Affordable 1:1 tutoring from the comfort of your home
Tutors are matched to your specific learning needs
30+ school subjects covered