Hello Visitor | 
Internet > S.E.O & S.E.M

The Basic Parts Of A Search Engine

Posted On: Wed, November 03rd, 2010 | Comments: 0 | Views 43 | Word Count: 758
Authored By:  daksh

While there are different ways to organize web content, every crawling search engine has the same basic parts:

Crawler (or Spider)
The crawler does just what its name implies. It scours the web following links,updating pages, and adding new pages when it comes across them. Each search engine has periods of deep crawling and periods of shallow crawling. There is also
a scheduler mechanism to prevent a spider from overloading servers and to tell the spider what documents to crawl next and how frequently to crawl them.
Rapidly changing or highly important documents are more likely to get crawled frequently. The frequency of crawl should typically have little effect on search
relevancy; it simply helps the search engines keep fresh content in their index. A popular,rapidly growing forum might get crawled a few dozen times each day. A static site
with little link popularity and rarely changing content might only get crawled once or twice a month.The best benefit of having a frequently crawled page is that you can get your new
sites, pages, or projects crawled quickly by linking to them from a powerful or frequently changing page.

The Index
The index is where the spider-collected data are stored. When you perform a search on a major search engine, you are not searching the web, but the cache of the web provided by that search engine’s index.
Reverse Index - Search engines organize their content in what is called a reverse index. A reverse index sorts web documents by words. When you search Google and it displays 1-
10 out of 143,000 websites, it means that there are approximately 143,000 web pages that either have the words from your search on them or have inbound links containing them. Also, note that search engines do not store punctuation, just
words.Storing Attributes - Since search engines view pages from their source code in a linear format, it is best to move JavaScript and other extraneous code to external files to help move the
page copy higher in the source code.Some people also use Cascading Style Sheets (CSS) or a blank table cell to place the page content ahead of the navigation. As far as how search engines evaluate what
words are first, they look at how the words appear in the source code. I have not done significant testing to determine if it is worth the effort to make your unique
page code appear ahead of the navigation, but if it does not take much additional effort, it is probably worth doing. Link analysis (discussed in depth later) is far
more important than page copy to most search algorithms, but every little bit can help.As well as storing the position of a word, search engines can also store how the
data are marked up. For example, is the term in the page title? Is it a heading?What type of heading? Is it bold? Is it emphasized? Is it in part of a list? Is it in
link text?Words that are in a heading or are set apart from normal text in other ways may be given additional weighting in many search algorithms. However, keep in mind that
it may be an unnatural pattern for your keyword phrases to appear many times in bold and headings without occurring in any of the regular textual body copy. Also,
if a page looks like it is aligned too perfectly with a topic, then that page may get a lower relevancy score than a page with a lower keyword density and more natural page copy.

Search Interface
The search algorithm and search interface are used to find the most relevant document in the index based on the search query. First the search engine tries todetermine user intent by looking at the words the searcher typed in.
These terms can be stripped down to their root level and checked against a lexical database to see what concepts they represent.Terms that are a near match will help you rank for other similarly related terms.
For example, using the word swims could help you rank well for swim or swimming.Search engines can try to match keyword vectors with each of the specific terms in
a query. If the search terms occur near each other frequently, the search engine may understand the phrase as a single unit and return documents related to that phrase.
WordNet is the most popular lexical database. At the end of this chapter there is a link to a Porter Stemmer tool if you need help conceptualizing how stemming works.

About the Author:

I'm james and i like to publish this useful information.currently,i working on link popularity building services such as directory submission, article submission and social bookmarking.

Keywords: search engine,seo,website,search engine,seo,website,search engine,seo,website,search engine,seo,website

Source: http://www.freearticlesinc.com/view_article-id-26380-at-The Basic Parts Of A Search Engine.html

  • Latest S.E.O & S.E.M Articles
  • More From daksh
  • Related Videos
Today because of the vast innovation of internet, there isn't any difficulty inside knowing regarding otherwise attaining any things. The main benefit of Indianapolis search engine marketing may be th...
emasis | Internet > S.E.O and S.E.M | Wed, May 23rd, 2012


Availing the affordable web marketing services from a reputed web marketing company will be able to help in the best promotion of the website. This will help to get the proper amount of visitors to th...
Tony Bryan | Internet > S.E.O and S.E.M | Sat, May 19th, 2012


Hiring the proper social media marketing consultants is very important because only a new media consultant will be able to use the various social networking sites most effectively to get the desired r...
Tony Bryan | Internet > S.E.O and S.E.M | Sat, May 19th, 2012


When looking to hire an SEO web marketing company, the focus should always be on hiring the top web marketing company because they will be able to provide the quality of services that the others canno...
Tony Bryan | Internet > S.E.O and S.E.M | Sat, May 19th, 2012


The process of SEO social media marketing has ruined the functioning and the essence of the internet. However, social media marketing SEO can still be used for providing quality information to the cus...
Tony Bryan | Internet > S.E.O and S.E.M | Sat, May 19th, 2012


It is common sense that if a web site is down it cannot get spidered, but we'll state it regardless:When a site is down, it cannot get spidered.
daksh | Internet > S.E.O & S.E.M | Mon, February 21st, 2011


There are a number of areas on the Internet that are used to help us find websites. These areas are generally referred to as search engines.
daksh | Internet > S.E.O & S.E.M | Sun, January 16th, 2011


Online Lead Generation is a critical digital marketing activity for businesses in a wide range of sectors
daksh | Internet > S.E.O & S.E.M | Tue, March 16th, 2010


It's a well-known fact that search engines like Google, Yahoo! and MSN value the number of incoming links pointing to your site, when deciding its ranking. It has also been established that links that...
daksh | Internet > Site Promotion | Wed, June 09th, 2010


No one can be really sure of what exactly it takes to get on top of the search engine results. There are a so many factors involved like;
daksh | Internet > Site Promotion | Tue, June 01st, 2010