Packages

object ContentExtractors

An object containing HtmlExtractor instances for extracting primitive data such as text, elements or attributes, as well as more complex information such as form data. Because they do perform little to no navigation through the document, they are typically preceded by a CSS query defining the location in the HTML document of the data to be retrieved.

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. ContentExtractors
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. Protected

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##: Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. val allText: HtmlExtractor[Element, String]

    An extractor for the text in all matched elements.

  5. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  6. def attr(attr: String): HtmlExtractor[Element, String]

    An extractor for the value of an attribute of the first matched element.

    An extractor for the value of an attribute of the first matched element.

    attr

    the attribute name to extract

    returns

    an extractor for an attribute of the first matched element.

  7. def attrs(attr: String): HtmlExtractor[Element, Iterable[String]]

    An extractor for a lazy iterable of the value of an attribute of each matched element.

    An extractor for a lazy iterable of the value of an attribute of each matched element.

    attr

    the attribute name to extract

    returns

    an extractor for a lazy iterable of the value of an attribute of each matched element.

  8. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.CloneNotSupportedException]) @HotSpotIntrinsicCandidate() @native()
  9. val element: HtmlExtractor[Element, Element]

    An extractor for the first element matched.

  10. val elementList: HtmlExtractor[Element, List[Element]]

    An extractor for a list of the matched elements.

  11. val elements: HtmlExtractor[Element, ElementQuery[Element]]

    An extractor for an ElementQuery with the matched elements.

  12. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  13. def equals(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef → Any
  14. val formData: HtmlExtractor[Element, Map[String, String]]

    An extractor for the form data present in the matched elements.

  15. val formDataAndAction: HtmlExtractor[Element, (Map[String, String], String)]

    An extractor for the form data present in the matched elements, together with the submission URL in the form.

  16. final def getClass(): Class[_ <: AnyRef]
    Definition Classes
    AnyRef → Any
    Annotations
    @HotSpotIntrinsicCandidate() @native()
  17. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @HotSpotIntrinsicCandidate() @native()
  18. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  19. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  20. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @HotSpotIntrinsicCandidate() @native()
  21. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @HotSpotIntrinsicCandidate() @native()
  22. val pElement: PolyHtmlExtractor { type Out[E] = E }

    An extractor for the first element matched.

    An extractor for the first element matched. It retains the concrete type of the elements being extracted.

  23. val pElementList: PolyHtmlExtractor { type Out[E] = List[E] }

    An extractor for a list of the matched elements.

    An extractor for a list of the matched elements. It retains the concrete type of the elements being extracted.

  24. val pElements: PolyHtmlExtractor { type Out[E <: net.ruippeixotog.scalascraper.model.Element] = net.ruippeixotog.scalascraper.model.ElementQuery[E] }

    An extractor for an ElementQuery with the matched elements.

    An extractor for an ElementQuery with the matched elements. It retains the concrete type of the elements being extracted.

  25. final def synchronized[T0](arg0: => T0): T0
    Definition Classes
    AnyRef
  26. val table: HtmlExtractor[Element, Vector[Vector[Element]]]

    An extractor for the cells of an HTML table.

    An extractor for the cells of an HTML table.

    Cells spanning multiple rows or columns are repeated in each of the positions they occupy. As such, well-formed rectangular tables always result in a Vector of Vectors with identical sizes.

    Rows in thead elements are always presented first, while rows inside tfoot elements are always at the end.

  27. val text: HtmlExtractor[Element, String]

    An extractor for the text in the first element matched.

  28. val texts: HtmlExtractor[Element, Iterable[String]]

    An extractor for a lazy iterable of the text in each element matched.

  29. def toString(): String
    Definition Classes
    AnyRef → Any
  30. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException])
  31. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException]) @native()
  32. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException])

Deprecated Value Members

  1. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.Throwable]) @Deprecated
    Deprecated

    (Since version 9)

Inherited from AnyRef

Inherited from Any

Ungrouped