(Translated by https://www.hiragana.jp/)
Meatball Wiki: WebSphinx

[Home]WebSphinx

MeatballWiki | RecentChanges | Random Page | Indices | Categories

WebSPHINX ( Website-Specific Processors for HTML INformation eXtraction) is a Java class library and interactive development environment for Web crawlers. A WebCrawler (also called a robot or spider) is a program that browses and processes Web pages automatically.

http://www.cs.cmu.edu/~rcm/websphinx/

The Crawler Workbench is a Java applet that puts a customizable Web crawler right in your browser. Using the Crawler Workbench, you can:

[See it in action!] .. but please be careful as it hammers the server. (Link now defunct.)


Source code is available at the above link.


CategoryInformationVisualization

Discussion

MeatballWiki | RecentChanges | Random Page | Indices | Categories
Edit text of this page | View other revisions
Search: