Extract data from Web sites using regular expressions.
WebExtractor360 is a free and open source web data extractor. It allows you to extract Images, Phrases, HTML Headers, HTML Tables, URLs (Links), URLs (Keywords), Emails, Phone, Fax and ANY other information on the web by specifying a Regular Expression. The web extractor software starts by crawling the specified web URL or any local file resource. All data that maps to the Match (Regular Expression) field will be returned as a result. Upon completion of the matching process for the specified URL, the crawler will continue to process other URLs that the specified URL links to. The entire process is repeated until the Maximun URL has been reached or there are no more URLs to process. This version is the first release on CNET Download.com
Extraction is either nonexistent or a jumbled mess.
mccurry.brian
Pros
None that I saw.
Cons
Extracting phone numbers gave me a bunch of numbers and symbols nowhere to be found on the website.
Extracting the table returned nothing.
The only thing I could get it to return that was accurate whatsoever was the title of the website.
Summary
Not worth downloading until they put a lot of work into this.
It works great for me!
Aeromium
Pros
One of the things I like is it allows you to configure for all kinds of information to extract (using regular expressions). In my opinion, it is better than most commercial products I have used before.
Cons
Over the years, many websites have attempted to hide email addresses from extraction. This should be the next hurdle for them. So far, I have not seen a commercial product that can effectively resolve this.
Summary
Easy to use. Cater for advance users like me. Open Source which means it is FREE! :)
not worth downloading
markpaulsherman
Pros
Quick to install and gives your the excited feeling that this is a good product
Cons
email, fax, feture dont work, i even tested it on my own web site and didnt find nothing.