Inout spider is a web spider script that crawls websites to generate data for small/medium sized search engines. This superb web crawler has been developed by Nesote Technologies as part of a series of web scripts including a number of popular clone scripts.
Inout spider is a powerful script built on Hypertable database under Hadoop file system. Inout Spider is capable of crawling and indexing large amounts of data and run queries with high speed and great accuracy to establish unique databases. The design department gets a wide range of challenging requests every day. There is a highly competitive team of designers capable of producing outputs that exceed your expectations.
With Inout Spider, you will have the tool needed to Crawl and build your own database
- Scalable Architectural Design
- The web Spider provides search results as XML instead of HTML results. It gives you great flexibility over the results. You may parse these XML results using your own PHP/ASP.NET/JSP program. Also, Inout Search Engine work seamlessly with Inout Spider to expose all the features/capabilities of this, by just configuring the Spider install path in Inout search engine admin area.
- The Spider web search always find the most relevant matched results available in your database for the given keywords. Right now, Inout Spider can handle Web, Image and News searches.
Image SearchInout Spider is one of the finest and rarest spider software’s available on the internet which provides image search. Inout Spider’s crawler will crawl images from the added domains while crawling the HTML pages and it tags various keywords and other parameters related to the images. The spider saves a thumbshot of the image in the database. The dimensions of the thumb shots can be configured by the administrator.
- Result Caching
Upon a search query, the Query Analyser [Please see the tutorial] module of Inout Spider will check the database for a result cache. If it finds a result there, it will immediately get the results back to the requester. If a pre-calculated recent result is not available already, ‘result generator’ will quickly identify and generate the results from the pages crawled, store in cache database, and send back to the requester. Result caching will help you to get instant search results for popular keywords.
- Global/Individual Domain Depth Control
Inout Spider helps you to control the crawling process by specifying the domain page depth limit. You can specify the settings globally for all domains or specifically for some selected domains. It will help you to make sure that the spider resources are utilized the way you want it to be.
- Seamless Integration with Inout Search Engine
Inout Spider is designed to be compatible with all third-party search engines however, Inout Search Engineis designed to integrate and work with Inout Spider seamlessly. This pairing is recommended if script customization is part of your requirement.
- Unlimited Custom Result Channel
Apart from web, image and news results, you may create a number of search channels like Script Search, Soccer Search, Wikipedia Search etc, with the help of domain sets and categories.
- Domain Sets
Inout Spider allows you to define a domain set in your spider admin area. Each page the Spider Bots crawl will be verified against the domain sets, and if it finds a match, it will tag the page to the corresponding domain set. You may later filter/retrieve your web/image/news results based on a domain set. It will help you to create a service (with a group of websites) specific search channel.
Similar to domain sets, Inout Spider allows you to define categories from the admin area. You may define the keywords related to a category so that if the Spider-bot finds a match with the keyword, it will tag the page to the corresponding category. You may later filter your web/image/news results based on the categories to create your desired channel.
- Intelligent Result Identifier System
- Page Rank
Inout Spider determines a page rank for each page. Also, the Spider crawls are based on many factors like incoming/outgoing links, page depth, domain priority, etc.
- Family Filter
Inout Spider allows you to configure family filter setting from your admin area to perform screening. The family filter is easy to manage and it will help you to retrieve results based on the family filter condition you want.
- API Keys