Users browsing this thread: 20 Guest(s)
Rebuilding Process Information Thread
#11
On the subject of retrieving data, wouldn't it be theoretically possible to make a DOM-parsing program that will auto-download the cached pages for you from Google and parse the data? I've never done this sort of thing before* but if it's really gonna take ages to do, it might be worth for me or one of the staff to look into.

*While I haven't done something to scrape pages automatically, I once made a DOM-parsing program in PHP that I never finished to clean up saved pages from a forum to archive them neatly and fix the broken CSS.


And a chat with Dazz tells me that Google rejects crawling-like activity. Oh well '_;
Thanked by: Kedric


Messages In This Thread
Rebuilding Process Information Thread - by Dazz - 01-07-2014, 11:44 AM
RE: Rebuilding Process Information Thread - by Phaze - 01-07-2014, 02:18 PM

Forum Jump: