June 2015 Zillman Column – Resources for Extracting Information from the World Wide Web
The June 2015 Zillman Column features Resources for Extracting Information from the World Wide Web by Marcus P. Zillman, M.S., A.M.H.A. and is a comprehensive listing of data extraction resources currently available on the Internet. Extracting data from the World Wide Web (WWW) has become an important issue in the last few years as the number of web pages available on the visible Internet has grown to billions of pages with over trillions of pages available from the invisible web. Tools and protocols to extract all this information have now come in demand as researchers as well as web browsers and surfers want to discover new knowledge at an ever increasing rate! As robots (bots) and intelligent agents are at the heart of many extraction tools I decided to create a compilation of the latest sources and sites that extract information from the web. There are a number of eMail extraction tools still available through the Internet and I have decided not to list these as they aid to the on-going and increasing problem of SPAM except for a readily available DMOZ Directory listing. The below compilation of sources is taken from my white paper titled Web Data Extractors and is constantly updated with Subject Tracer™ bots at the following URL: http://www.WebDataExtractors.com/. Download this excellent freely available 19 page 168KB pdf column today and begin your online knowledge discovery into extracting information from the World Wide Web including excellent sources, tools, and sites!. This is another MUST have column in today’s ever changing and New Economy world!!