From Tortuga: Gather intelligence data for a list of domains. This program gathers summary data for each domain in a list. It collects country/state geographical information by parsing pages such as "Contact" pages. It collects Google Adsense IDs, page titles, charset, and builds a word vector containing the most frequent words found on the domain's home page (disregarding stop words).
read more +