RubyGems Navigation menu

scrappy 0.1

Scrappy is a tool that allows extracting information from web pages and producing RDF data. It uses the scraping ontology to define the mappings between HTML contents and RDF data. An example of mapping is shown next, which allows extracting all titles from http://www.elmundo.es: dc: http://purl.org/dc/elements/1.1/ rdf: http://www.w3.org/1999/02/22-rdf-syntax-ns# sioc: http://rdfs.org/sioc/ns# sc: http://lab.gsi.dit.upm.es/scraping.rdf# *: rdf:type: sc:Fragment sc:selector: *: rdf:type: sc:UriSelector rdf:value: "http://www.elmundo.es/" sc:identifier: *: rdf:type: sc:BaseUriSelector sc:subfragment: *: sc:type: sioc:Post sc:selector: *: rdf:type: sc:CssSelector rdf:value: ".noticia h2, .noticia h3, .noticia h4" sc:identifier: *: rdf:type: sc:CssSelector rdf:value: "a" sc:attribute: "href" sc:subfragment: *: sc:type: rdf:Literal sc:relation: dc:title sc:selector: *: rdf:type: sc:CssSelector rdf:value: "a" (The above code is serialized using YARF format, supported by LightRDF gem, as well as RDFXML, JSON, NTriples formats, which can also be used to define the mappings).

Gemfile:
= Copy to clipboard Copied!

Installeer:
=

Versies:

  1. 0.4.10 - December 01, 2011 (52 KB)
  2. 0.4.9 - November 28, 2011 (52 KB)
  3. 0.4.8 - November 24, 2011 (52 KB)
  4. 0.4.7 - November 24, 2011 (52 KB)
  5. 0.4.6 - October 05, 2011 (52 KB)
  6. 0.1 - October 07, 2010 (17 KB)
Toon alle versies (44 totaal)

Runtime afhankelijkheden (6):

activesupport >= 2.3.5
camping = 2.0
lightrdf >= 0.1
markaby >= 0.7.1
mechanize >= 1.0.0
nokogiri >= 1.4.1

Development afhankelijkheden (2):

hoe >= 2.6.0
rubyforge >= 2.0.4

Eigenaren:

Authors:

  • Jose Ignacio

SHA 256 checksum:

= Copy to clipboard Copied!

Total downloads 136.374

Voor deze versie 3.315

Licenties:

N/A

Required Ruby Version: Geen

Links: