You get the HTML as an object first, then you can pull what you want from
it.

If you want to parse it as a string, just use CFHTTP.

On Wed, Nov 16, 2011 at 2:04 PM, Jason King <[email protected]> wrote:

> doc = html( FileRead("http://www.openbd.org/";)
>
> Is the function pulling the html serviced by openbd.org and parsing it?
>
> I just want to parse a string, not an entire html document. Or, is it getting 
> the 'string' by 'fileread' so what's captured by fileread IS A STRING of html 
> characters, not any different than the string I'm providing?
>
>
> On Wed, Nov 16, 2011 at 4:01 PM, Jason King <[email protected]>wrote:
>
>> I'm looking at it now..
>>
>> do i have to use CFSCRIPT?
>>
>>
>> On Wed, Nov 16, 2011 at 4:00 PM, Alex Skinner <[email protected]> wrote:
>>
>>> Did you have a look at the example docs I sent
>>>
>>> So the example code on the manual page would be pretty close ?
>>>
>>> <cfscript>
>>> doc = html( FileRead("http://www.openbd.org/";) );
>>>
>>> allLinks = doc.select("a[href]");
>>> for ( x = 1; x <= ArrayLen(allLinks); x++ ){
>>>   WriteOutput("<li>" & allLinks[x].attr("href") & " / " & 
>>> allLinks[x].text() & "</li>" );
>>> }
>>> </cfscript>
>>>
>>>
>>> Or the example on jsoup ?
>>>
>>> http://jsoup.org/cookbook/extracting-data/attributes-text-html
>>>
>>> Alex
>>>
>>>
>>>
>>> On 16 November 2011 21:56, Jason King <[email protected]> wrote:
>>>
>>>> ahh.. i never really work with J-Query
>>>>
>>>> How would I use it to get 'http://google.com' out of the link ref
>>>> below?
>>>>
>>>> <a href="http://google.com";></a>
>>>>
>>>> Assume
>>>>
>>>> <CFSET var = '<a href="http://google.com";></a>'>
>>>>
>>>> How would I apply html() to 'var' to strip that out?
>>>>
>>>>
>>>>
>>>> On Wed, Nov 16, 2011 at 3:48 PM, Alex Skinner <[email protected]> wrote:
>>>>
>>>>> Now you can do JQuery-like selectors on it
>>>>>
>>>>> You've just been given a document object
>>>>>
>>>>> http://www.openbluedragon.org/manual/?/function/html
>>>>>
>>>>> http://jsoup.org/
>>>>>
>>>>> For some samples
>>>>>
>>>>> Alex
>>>>>
>>>>> On 16 November 2011 21:44, Jason King <[email protected]> wrote:
>>>>>
>>>>>> Upgraded....
>>>>>>
>>>>>> I have a string called 'body' and I'm applying it to the function.
>>>>>>
>>>>>> <cfset demovarA = "#html(body)#">
>>>>>>
>>>>>> What's going on here? debugging is giving me this entire array as the
>>>>>> value of demovarA
>>>>>>
>>>>>> *object of org.jsoup.nodes.Document*
>>>>>> Methods clone (returns org.jsoup.nodes.Node)
>>>>>>
>>>>>> clone (returns org.jsoup.nodes.Element)
>>>>>>
>>>>>> clone (returns org.jsoup.nodes.Document)
>>>>>> clone (returns java.lang.Object)
>>>>>> head (returns org.jsoup.nodes.Element)
>>>>>> body (returns org.jsoup.nodes.Element)
>>>>>> text (returns org.jsoup.nodes.Element)
>>>>>> nodeName (returns java.lang.String)
>>>>>> createElement (returns org.jsoup.nodes.Element)
>>>>>> normalise (returns org.jsoup.nodes.Document)
>>>>>> title (returns void)
>>>>>> title (returns java.lang.String)
>>>>>> createShell (returns org.jsoup.nodes.Document)
>>>>>> quirksMode (returns org.jsoup.nodes.Document)
>>>>>> quirksMode (returns org.jsoup.nodes.Document$QuirksMode)
>>>>>> outerHtml (returns java.lang.String)
>>>>>> outputSettings (returns org.jsoup.nodes.Document$OutputSettings)
>>>>>> parent (returns org.jsoup.nodes.Element)
>>>>>> parent (returns org.jsoup.nodes.Node)
>>>>>> addClass (returns org.jsoup.nodes.Element)
>>>>>> equals (returns boolean)
>>>>>> toString (returns java.lang.String)
>>>>>> append (returns org.jsoup.nodes.Element)
>>>>>> hashCode (returns int)
>>>>>> wrap (returns org.jsoup.nodes.Node)
>>>>>> wrap (returns org.jsoup.nodes.Element)
>>>>>> empty (returns org.jsoup.nodes.Element)
>>>>>> val (returns java.lang.String)
>>>>>> val (returns org.jsoup.nodes.Element)
>>>>>> after (returns org.jsoup.nodes.Node)
>>>>>> after (returns org.jsoup.nodes.Node)
>>>>>> after (returns org.jsoup.nodes.Element)
>>>>>> after (returns org.jsoup.nodes.Element)
>>>>>> before (returns org.jsoup.nodes.Element)
>>>>>> before (returns org.jsoup.nodes.Node)
>>>>>> before (returns org.jsoup.nodes.Element)
>>>>>> before (returns org.jsoup.nodes.Node)
>>>>>> id (returns java.lang.String)
>>>>>> attr (returns org.jsoup.nodes.Element)
>>>>>> attr (returns org.jsoup.nodes.Node)
>>>>>> prepend (returns org.jsoup.nodes.Element)
>>>>>> children (returns org.jsoup.select.Elements)
>>>>>> className (returns java.lang.String)
>>>>>> text (returns java.lang.String)
>>>>>> data (returns java.lang.String)
>>>>>> child (returns org.jsoup.nodes.Element)
>>>>>> appendChild (returns org.jsoup.nodes.Element)
>>>>>> select (returns org.jsoup.select.Elements)
>>>>>> html (returns java.lang.String)
>>>>>> html (returns org.jsoup.nodes.Element)
>>>>>> getElementById (returns org.jsoup.nodes.Element)
>>>>>> tagName (returns org.jsoup.nodes.Element)
>>>>>> tagName (returns java.lang.String)
>>>>>> tag (returns org.jsoup.parser.Tag)
>>>>>> dataset (returns java.util.Map)
>>>>>> getAllElements (returns org.jsoup.select.Elements)
>>>>>> classNames (returns org.jsoup.nodes.Element)
>>>>>> classNames (returns java.util.Set)
>>>>>> isBlock (returns boolean)
>>>>>> parents (returns org.jsoup.select.Elements)
>>>>>> prependChild (returns org.jsoup.nodes.Element)
>>>>>> appendElement (returns org.jsoup.nodes.Element)
>>>>>> prependElement (returns org.jsoup.nodes.Element)
>>>>>> appendText (returns org.jsoup.nodes.Element)
>>>>>> prependText (returns org.jsoup.nodes.Element)
>>>>>> siblingElements (returns org.jsoup.select.Elements)
>>>>>> nextElementSibling (returns org.jsoup.nodes.Element)
>>>>>> previousElementSibling (returns org.jsoup.nodes.Element)
>>>>>> firstElementSibling (returns org.jsoup.nodes.Element)
>>>>>> elementSiblingIndex (returns java.lang.Integer)
>>>>>> lastElementSibling (returns org.jsoup.nodes.Element)
>>>>>> getElementsByTag (returns org.jsoup.select.Elements)
>>>>>> getElementsByClass (returns org.jsoup.select.Elements)
>>>>>> getElementsByAttribute (returns org.jsoup.select.Elements)
>>>>>> getElementsByAttributeStarting (returns org.jsoup.select.Elements)
>>>>>> getElementsByAttributeValue (returns org.jsoup.select.Elements)
>>>>>> getElementsByAttributeValueNot (returns org.jsoup.select.Elements)
>>>>>> getElementsByAttributeValueStarting (returns
>>>>>> org.jsoup.select.Elements)
>>>>>> getElementsByAttributeValueEnding (returns org.jsoup.select.Elements)
>>>>>> getElementsByAttributeValueContaining (returns
>>>>>> org.jsoup.select.Elements)
>>>>>> getElementsByAttributeValueMatching (returns
>>>>>> org.jsoup.select.Elements)
>>>>>> getElementsByAttributeValueMatching (returns
>>>>>> org.jsoup.select.Elements)
>>>>>> getElementsByIndexLessThan (returns org.jsoup.select.Elements)
>>>>>> getElementsByIndexGreaterThan (returns org.jsoup.select.Elements)
>>>>>> getElementsByIndexEquals (returns org.jsoup.select.Elements)
>>>>>> getElementsContainingText (returns org.jsoup.select.Elements)
>>>>>> getElementsContainingOwnText (returns org.jsoup.select.Elements)
>>>>>> getElementsMatchingText (returns org.jsoup.select.Elements)
>>>>>> getElementsMatchingText (returns org.jsoup.select.Elements)
>>>>>> getElementsMatchingOwnText (returns org.jsoup.select.Elements)
>>>>>> getElementsMatchingOwnText (returns org.jsoup.select.Elements)
>>>>>> ownText (returns java.lang.String)
>>>>>> hasText (returns boolean)
>>>>>> hasClass (returns boolean)
>>>>>> removeClass (returns org.jsoup.nodes.Element)
>>>>>> toggleClass (returns org.jsoup.nodes.Element)
>>>>>> remove (returns void)
>>>>>> replaceWith (returns void)
>>>>>> attr (returns java.lang.String)
>>>>>> childNode (returns org.jsoup.nodes.Node)
>>>>>> attributes (returns org.jsoup.nodes.Attributes)
>>>>>> nextSibling (returns org.jsoup.nodes.Node)
>>>>>> previousSibling (returns org.jsoup.nodes.Node)
>>>>>> ownerDocument (returns org.jsoup.nodes.Document)
>>>>>> unwrap (returns org.jsoup.nodes.Node)
>>>>>> baseUri (returns java.lang.String)
>>>>>> setBaseUri (returns void)
>>>>>> childNodes (returns java.util.List)
>>>>>> absUrl (returns java.lang.String)
>>>>>> siblingIndex (returns int)
>>>>>> hasAttr (returns boolean)
>>>>>> removeAttr (returns org.jsoup.nodes.Node)
>>>>>> siblingNodes (returns java.util.List)
>>>>>> wait (returns void)
>>>>>> wait (returns void)
>>>>>> wait (returns void)
>>>>>> getClass (returns java.lang.Class)
>>>>>> notify (returns void)
>>>>>> notifyAll (returns void)
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>>  --
>>>>>> official tag/function reference: http://openbd.org/manual/
>>>>>> mailing list - http://groups.google.com/group/openbd?hl=en
>>>>>>
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Alex Skinner
>>>>> Managing Director
>>>>> Pixl8 Interactive
>>>>>
>>>>> Tel: +448452600726
>>>>> Email: [email protected]
>>>>> Web: pixl8.co.uk
>>>>>
>>>>>  --
>>>>> official tag/function reference: http://openbd.org/manual/
>>>>> mailing list - http://groups.google.com/group/openbd?hl=en
>>>>>
>>>>
>>>>  --
>>>> official tag/function reference: http://openbd.org/manual/
>>>> mailing list - http://groups.google.com/group/openbd?hl=en
>>>>
>>>
>>>
>>>
>>> --
>>> Alex Skinner
>>> Managing Director
>>> Pixl8 Interactive
>>>
>>> Tel: +448452600726
>>> Email: [email protected]
>>> Web: pixl8.co.uk
>>>
>>>  --
>>> official tag/function reference: http://openbd.org/manual/
>>> mailing list - http://groups.google.com/group/openbd?hl=en
>>>
>>
>>
>  --
> official tag/function reference: http://openbd.org/manual/
> mailing list - http://groups.google.com/group/openbd?hl=en
>



-- 
Matthew Woodward
[email protected]
http://blog.mattwoodward.com
identi.ca / Twitter: @mpwoodward

Please do not send me proprietary file formats such as Word, PowerPoint,
etc. as attachments.
http://www.gnu.org/philosophy/no-word-attachments.html

-- 
official tag/function reference: http://openbd.org/manual/
 mailing list - http://groups.google.com/group/openbd?hl=en

Reply via email to