CGI Script will spider your websitet to
index all html files then you can run the parser to convert the html to RAW TEXT.
Great way to build information databases free from html tags if constructing search
engines. Also useful analytical tool for search engine studies such as keyword
densities, etc. Great subroutine that can be applied by the wise webmaster for many
purposes. Useful tool for integrating with LINKING scripts such as gossamer threads,
etc. Extracts the core data from HTML and turns into simple text.
System Requirements
Perl 5
Telnet recommended
Features
HTML parsed and tags removed
Sorts unique words
Counts unique words
Captures metakeywords
Captures metadescriptions
Captures TITLE
Creates text summaries of
webpage
Automatic spidering or manual
entry allows you to parse only the files you choose
Great tool for webmasters
for many purposes
Creates keyword ranking numbers
for search engine studies