This is a simple web application that is able to extract images from urls and caches them. It's using Reddit's scraper code, with a couple small tweaks, mostly to allow it to run on Google App Engine. We use it for Msgboy.
The app provides a single call: /?url=
, which will return a URL corresponding to the location of that image. Your app should cache that url.
Please note that the URL returned may also be a redirect url, which should 'soon' redirect to the actual image when it's been extracted.
- Signup or log in to Google App Engine
- Create a new application, name it in a non-guessable way if you want to be the only one using it :)
- Checkout the code :
git clone git://github.com/superfeedr/image-extrator.git
- Update your application name’s in app.yaml, line 1 (replace
image-extrator
) with whatever you chose at step 2. - Deploy your instance.
You will quickly (100,000 requests/day) bump into Google App Engine's limit for the 1GB incoming bandwidth. Upgrade to a "paid" app... but don't worry too much, as the incoming bandwidth is free on Google App Engine!
Please check our other community services: Feediscovery and Notifixlight.