Packages

package scraper

Ordering
  1. Alphabetic
Visibility
  1. Public
  2. Protected

Type Members

  1. trait HtmlExtractor[-E <: Element, +A] extends (ElementQuery[E]) => A

    An object able to extract content from net.ruippeixotog.scalascraper.model.ElementQuery instances.

    An object able to extract content from net.ruippeixotog.scalascraper.model.ElementQuery instances.

    E

    the type of the elements needed by this HtmlExtractor

    A

    the type of the extracted content

  2. trait HtmlExtractorInstances extends AnyRef
  3. trait HtmlValidator[-E <: Element, +R] extends AnyRef
  4. trait PolyHtmlExtractor extends AnyRef

    An extractor like HtmlExtractor but whose extracted content type depends on the type of the input net.ruippeixotog.scalascraper.model.Element s.

    An extractor like HtmlExtractor but whose extracted content type depends on the type of the input net.ruippeixotog.scalascraper.model.Element s. A PolyHtmlExtractor supports application of CSS queries and can be turned into a normal HtmlExtractor by calling its apply[E] method, fixing the type of the input Element as E.

Value Members

  1. object ContentExtractors

    An object containing HtmlExtractor instances for extracting primitive data such as text, elements or attributes, as well as more complex information such as form data.

    An object containing HtmlExtractor instances for extracting primitive data such as text, elements or attributes, as well as more complex information such as form data. Because they do perform little to no navigation through the document, they are typically preceded by a CSS query defining the location in the HTML document of the data to be retrieved.

  2. object ContentParsers

    An object containing functions for parsing extracted content.

    An object containing functions for parsing extracted content. They can be used together with the DSL extractor method or by calling map on a HtmlExtractor with them.

  3. object HtmlExtractor extends HtmlExtractorInstances

    The companion object for HtmlExtractor, containing methods for creating new extractors.

  4. object HtmlValidator
  5. object PolyHtmlExtractor

Ungrouped