Friday, January 13, 2012

Vertical search

Vertical search is targeted at a particular industry professional search engine, search engines are broken down and extended, the webpage database is some kind of specialized information integrate a directional branch field, extract the needed data to be processed and then in some form is returned to the user. Relatively general search engines with large amount of information, the query is not accurate, not enough depth, put forward the new search engine service mode, through targeted to a specific domain, a specific group or a specific demand to provide some valuable information and related services. Its characteristic is "specialized, refined, deep", and with industry of color, compared to the general search engine information disordering, vertical search engine is more focused, specific and thorough.
Brief introduction
Vertical search engine is relatively general search engines with large amount of information, the query is not accurate, not enough depth, put forward the new search engine service mode, through targeted to a specific domain, a specific group or a specific demand to provide some valuable information and related services. Its characteristic is "specialized, refined, deep", and with industry of color, compared to the general search engine information disordering, vertical search engine is more focused, specific and thorough.
Vertical search engine and general webpage search difference
Vertical search engine and ordinary webpage search engine is the biggest difference is to webpage information in a structured information extraction, namely the webpage of the unstructured data into structured data extraction of specific information, such as webpage search is a webpage for the smallest unit, based on the visual analysis in the webpage webpage block block is the smallest unit, while the vertical the search is based on structured data is the smallest unit. Then these data are stored into a database, for further processing, such as: weight, classification, the last word, index to search the way to meet the needs of users. The entire process, data from unstructured data into structured data extraction, after deep processing unstructured and structured way back to the user.
Microsoft Research Institute, a technical expert once said:" the 75% content using the search engine". Vertical search engines birth is to greatly improve the search" recall" and "precision". Vertical search engine based on the industry in the field of information model and user model structured collection or organization, and provide more and more professional, personalized industry related services.
Application direction
The application of vertical search engine direction, such as the enterprise library search, supply and demand information search, shopping search, property search, talent search, map search, MP3 search, image search ... ... Almost all walks of life, all kinds of information can be further refined into various types of vertical search engine. An example would be easier to understand, such as shopping search engine, the whole process is as follows: grab the webpage, the webpage commodity information extraction, extract the name of commodity, price, description ... ... Even can be further subdivided into " brand, notebook models, memory, hard disk, CPU, screen, ... ..." Then the information for cleaning, to heavy, classification, comparison and analysis, data mining, finally through the word index provide users to search, through the analysis of mining to provide market reports.
Technology
Vertical search engine generally requires the following technology
1 search engine spiders crawl the Internet: the relevant webpage
2 webpage structured information extraction or metadata acquisition technology from the webpage: extracting structured data
3 word segmentation, indexing: storage and index data
4 data shows: as the stored data are not simple webpage data, need to consider according to industry demand for display
5 other information processing technology

No comments:

Post a Comment