Package | Description |
---|---|
org.jsoup |
Contains the main
Jsoup class, which provides convenient static access to the jsoup functionality. |
org.jsoup.helper | |
org.jsoup.nodes |
HTML document structure nodes.
|
org.jsoup.parser |
Contains the HTML parser, tag specifications, and HTML tokeniser.
|
org.jsoup.safety |
Contains the jsoup HTML cleaner, and whitelist definitions.
|
Modifier and Type | Method and Description |
---|---|
Document |
Connection.get()
Execute the request as a GET, and parse the result.
|
Document |
Connection.Response.parse()
Read and parse the body of the response as a Document.
|
static Document |
Jsoup.parse(java.io.File in,
java.lang.String charsetName)
Parse the contents of a file as HTML.
|
static Document |
Jsoup.parse(java.io.File in,
java.lang.String charsetName,
java.lang.String baseUri)
Parse the contents of a file as HTML.
|
static Document |
Jsoup.parse(java.io.InputStream in,
java.lang.String charsetName,
java.lang.String baseUri)
Read an input stream, and parse it to a Document.
|
static Document |
Jsoup.parse(java.io.InputStream in,
java.lang.String charsetName,
java.lang.String baseUri,
Parser parser)
Read an input stream, and parse it to a Document.
|
static Document |
Jsoup.parse(java.lang.String html)
Parse HTML into a Document.
|
static Document |
Jsoup.parse(java.lang.String html,
java.lang.String baseUri)
Parse HTML into a Document.
|
static Document |
Jsoup.parse(java.lang.String html,
java.lang.String baseUri,
Parser parser)
Parse HTML into a Document, using the provided Parser.
|
static Document |
Jsoup.parse(java.net.URL url,
int timeoutMillis)
Fetch a URL, and parse it as HTML.
|
static Document |
Jsoup.parseBodyFragment(java.lang.String bodyHtml)
Parse a fragment of HTML, with the assumption that it forms the
body of the HTML. |
static Document |
Jsoup.parseBodyFragment(java.lang.String bodyHtml,
java.lang.String baseUri)
Parse a fragment of HTML, with the assumption that it forms the
body of the HTML. |
Document |
Connection.post()
Execute the request as a POST, and parse the result.
|
Modifier and Type | Method and Description |
---|---|
Document |
HttpConnection.get() |
static Document |
DataUtil.load(java.io.File in,
java.lang.String charsetName,
java.lang.String baseUri)
Loads a file to a Document.
|
static Document |
DataUtil.load(java.io.InputStream in,
java.lang.String charsetName,
java.lang.String baseUri)
Parses a Document from an input steam.
|
static Document |
DataUtil.load(java.io.InputStream in,
java.lang.String charsetName,
java.lang.String baseUri,
Parser parser)
Parses a Document from an input steam, using the provided Parser.
|
Document |
HttpConnection.Response.parse() |
(package private) static Document |
DataUtil.parseInputStream(java.io.InputStream input,
java.lang.String charsetName,
java.lang.String baseUri,
Parser parser) |
Document |
HttpConnection.post() |
Modifier and Type | Method and Description |
---|---|
void |
W3CDom.convert(Document in,
org.w3c.dom.Document out)
Converts a jsoup document into the provided W3C Document.
|
org.w3c.dom.Document |
W3CDom.fromJsoup(Document in)
Convert a jsoup Document to a W3C Document.
|
Modifier and Type | Method and Description |
---|---|
Document |
Document.clone() |
static Document |
Document.createShell(java.lang.String baseUri)
Create a valid, empty shell of a document, suitable for adding more elements to.
|
Document |
Document.normalise()
Normalise the document.
|
Document |
Document.outputSettings(Document.OutputSettings outputSettings)
Set the document's output settings.
|
Document |
Node.ownerDocument()
Gets the Document associated with this Node.
|
Document |
Document.quirksMode(Document.QuirksMode quirksMode) |
Modifier and Type | Field and Description |
---|---|
protected Document |
TreeBuilder.doc |
Modifier and Type | Method and Description |
---|---|
(package private) Document |
HtmlTreeBuilder.getDocument() |
(package private) Document |
XmlTreeBuilder.parse(java.io.Reader input,
java.lang.String baseUri) |
(package private) Document |
TreeBuilder.parse(java.io.Reader input,
java.lang.String baseUri,
ParseErrorList errors,
ParseSettings settings) |
(package private) Document |
XmlTreeBuilder.parse(java.lang.String input,
java.lang.String baseUri) |
static Document |
Parser.parse(java.lang.String html,
java.lang.String baseUri)
Parse HTML into a Document.
|
static Document |
Parser.parseBodyFragment(java.lang.String bodyHtml,
java.lang.String baseUri)
Parse a fragment of HTML into the
body of a Document. |
static Document |
Parser.parseBodyFragmentRelaxed(java.lang.String bodyHtml,
java.lang.String baseUri)
|
Document |
Parser.parseInput(java.io.Reader inputHtml,
java.lang.String baseUri) |
Document |
Parser.parseInput(java.lang.String html,
java.lang.String baseUri) |
Modifier and Type | Method and Description |
---|---|
Document |
Cleaner.clean(Document dirtyDocument)
Creates a new, clean document, from the original dirty document, containing only elements allowed by the whitelist.
|
Modifier and Type | Method and Description |
---|---|
Document |
Cleaner.clean(Document dirtyDocument)
Creates a new, clean document, from the original dirty document, containing only elements allowed by the whitelist.
|
boolean |
Cleaner.isValid(Document dirtyDocument)
Determines if the input document bodyis valid, against the whitelist.
|