| advertise add site services publishers database health videos | ![]() | about toolbar stats live show health store more stuff JOIN/LOGIN |
clinicallab.net: The Leading Medical Lab Site on the Net clinicallab.net | StratOG.net | About StratOG.net stratog.net |
arachnode.net is a .NET web crawler written in C# using SQL 2008 and Lucene and is released under the GPL. [edit] Features
[edit] ApplicationsContent Aggregation: Use for personal content aggregation, crawling intranets of any size or crawling the Internet as a whole. Discovered content is parsed and stored into multiple configurable forms and locations. Research and Analysis: Extract, collect and parse downloaded content into multiple forms, including XML. SSIS packages and Common Language Runtime functions extract terms and phrases from text content, and provide over 250 stored procedures, views and functions to jumpstart SQL Server Analysis Services or other text mining applications. Search: Discovered content is indexed and stored in Lucene indexes and can be searched through a familiar Web interface. Text Mining: Extract words, phrases, tags and text from discovered content. Education: Learn introductory to advanced crawling techniques, and features of the .NET Framework and SQL Server 2008, including full-text indexing, multi-threading, caching, reflection, interfaces, object-oriented concepts, SQL common language runtime functions and regular expressions. [edit] External links
| ||||||||||||||||||||||||||||||||||
| ↑ top of page ↑ | about thumbshots |