NOTE: Please see the update HERE and HERE!
...When reading Scott Chemberlain's last post about web-scraping I felt it was time to pick up and complete an idea that I was brooding over for some time now:
When a scientist aims out for a new project the first thing to do is to evaluate if other people already have come along to answer the very questions he is about to work on. I.e., I was interested if there has been done any research regarding amphibian diversity at regional/geographical scales correlated to environmental/landscape parameters. Usually I would got to Google-Scholar and search something like - intitle:amphibians AND intitle:richness OR intitle:diversity AND environment OR landscape - and then browse thru the results. But, this is often tedious and a way for a quick visual examination would be of great benefit.
1 Nov 2011
23 Oct 2011
A Little Webscraping-Exercise...
In R it's quite easy to pull out anything from a webpage and I'll show a little exercise in doing so.
Here I retrieve all blog addresses from R-bloggers by the function readLines() and some subsequent data processing.
Labels:
grep()
,
HTML
,
R
,
readLines()
,
String-Manipulation
,
strsplit()
,
sub()
,
Web Scraping
Subscribe to:
Comments
(
Atom
)

