An easy to implement professional web scraper for WordPress. This can be used to display realtime data from any websites directly into your posts, pages or sidebar. Use this to include realtime stock quotes, cricket or soccer scores or any other generic content. The scraper is an extension of WP_HTTP class for scraping and uses phpQuery or xpath for parsing HTML. Features include:
Can be easily implemented using the button in the post / page editor.
Configurable caching of scraped data. Cache timeout in minutes can be defined in minutes for every scrap.
Configurable Useragent for your scraper can be set for every scrap.
Scrap output can be displayed thru custom template tag, shortcode in page, post and sidebar (through a text widget).
Other configurable settings like timeout, disabling shortcode etc.
Error handling - Silent fail, error display, custom error message or display expired cache.
Clear or replace a regex pattern from the scrap before output.
Option to pass post arguments to a URL to be scraped.
Dynamic conversion of scrap to specified character encoding (using incov) to scrap data from a site using different charset.
Create scrap pages on the fly using dynamic generation of URLs to scrap or post arguments based on your page's get or post arguments.
Callback function to parse the scraped data.
For demos and support, visit the WP Web Scraper project page. Comments appreciated.
Tags: curl, html, import, page, phpquery, Post, Realtime, sidebar, stock market, web scraping, xpath
Source: http://wordpress.org/plugins/wp-web-scrapper/
Can be easily implemented using the button in the post / page editor.
Configurable caching of scraped data. Cache timeout in minutes can be defined in minutes for every scrap.
Configurable Useragent for your scraper can be set for every scrap.
Scrap output can be displayed thru custom template tag, shortcode in page, post and sidebar (through a text widget).
Other configurable settings like timeout, disabling shortcode etc.
Error handling - Silent fail, error display, custom error message or display expired cache.
Clear or replace a regex pattern from the scrap before output.
Option to pass post arguments to a URL to be scraped.
Dynamic conversion of scrap to specified character encoding (using incov) to scrap data from a site using different charset.
Create scrap pages on the fly using dynamic generation of URLs to scrap or post arguments based on your page's get or post arguments.
Callback function to parse the scraped data.
For demos and support, visit the WP Web Scraper project page. Comments appreciated.
Tags: curl, html, import, page, phpquery, Post, Realtime, sidebar, stock market, web scraping, xpath
Source: http://wordpress.org/plugins/wp-web-scrapper/
Nice posting,thanks for sharing the great information with us and topics is the scraping website and i read the full blog and i recommend this blog to my friends and i definitely sure they will benefit from this blog.
ReplyDeleteYell Data Extractor