[PHP] Extract specific div element from page
Hey folks, I need to pull the contents inside of a specific div out of a page, and write it to a separate file. In this instance I am taking everything inside of div id=content/div tags from a wordpress blog, this will give me only the content and not the menus, or other stuff. I need to do this because the final document will be converted for viewing on a palm pilot. Is anyone aware of a simple solution to this problem, short of parsing the entire page and starting when I hit that div opening tag, and stopping when I hit the closing tag? One problem I can see with this method is that I would have to count divs inside of that div, otherwise I would end too early on. Any advice would be greatly appreciated. Peace and Love, distatica. -- - Anthony Hiscox Video Watch Group Public Site Currently Under Development Group Members Site Fully Operational -
Re: [PHP] Re: Extract specific div element from page
Oops, I accidentally sent this directly to CK, my apologies. Thank you for your replies. The reason that I didn't explore the JS route is because this will be running in the background, I didn't want to have to visit the page in any way. I went looking for an easy way to accomplish this in PHP but due to malformed HTML in some sites (not wordpress that I am aware of) it wasn't going to be so easy. Someone in ##php on irc.freenode.net pointed me to BeautifulSoup which is a Python module for scraping pages even if they have bad HTML. Within a minute I had a script that grabbed the parts I wanted, and even removed the parts I didn't (such as comments). Now I have a Python script that runs when I am going to update the docs on my Palm, it grabs the page(s), strips out the unimportant stuff, saves to a local directory, and then I have Sunrise parse that into plucker document format. Once again, thank you for the responses. On 6/15/07, Dan [EMAIL PROTECTED] wrote: Or you could just use Javascript combined with PHP, just use javascript it's something like this document.getElementById('tagId').innerHtml that will give you the html(contents) of the div tag you specify. Then just do something like document.form.value = document.getElementById('tagId').innerHtml. Basicly you're setting a hidden form element to have the value of the div, then when you submit the page, you have the content as $_POST['formYouSetTo']. You could have the JS execute on the submit button's onclick. It should be relatively easy if you look up the exact syntax of the javascript. - Daniel Anthony Hiscox [EMAIL PROTECTED] wrote in message news:[EMAIL PROTECTED] Hey folks, I need to pull the contents inside of a specific div out of a page, and write it to a separate file. In this instance I am taking everything inside of div id=content/div tags from a wordpress blog, this will give me only the content and not the menus, or other stuff. I need to do this because the final document will be converted for viewing on a palm pilot. Is anyone aware of a simple solution to this problem, short of parsing the entire page and starting when I hit that div opening tag, and stopping when I hit the closing tag? One problem I can see with this method is that I would have to count divs inside of that div, otherwise I would end too early on. Any advice would be greatly appreciated. Peace and Love, distatica. -- - Anthony Hiscox Video Watch Group Public Site Currently Under Development Group Members Site Fully Operational - -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php -- - Anthony Hiscox Video Watch Group Public Site Currently Under Development Group Members Site Fully Operational -
[PHP] Developer / Client Documents
First my problem, then a tiny bit of background information. I am trying to start a small (Read: Tiny) web development company from home, which will handle only a few light contracts. I have been spending some time searching Google for common documents used by developers and their clients to help layout all the goals and features the site might have. Unfortunately since I have not obtained any formal education, I am experiencing a slight fish-out-of-water syndrome, and am not even sure of the names of the documents I am seeking. I would be eternally grateful if some experienced developers would point me in the right direction; any documents related to planning the website, contracts, really anything would be considered useful. Now a little background, I am a stay at home father, with the majority of my daily duties centralized around wiping feces from the rear end of a young boy, feeding, and teaching. I don't have the time, money, nor resources to go to school for this, but am quite happy reading about web development whenever I get the opportunity (often 3-6 hours daily). I have what I believe to be a solid understanding of PHP, MySQL and Javascript basics, as well as w3c compliant HTML, and CSS. Currently I am spending some time studying the CakePHP framework. I also have a (very) basic understanding of XSS attacks, and SQL injections techniques and am working on improving this. I welcome ALL advice, even if it is not related to my initial question (Developer / Client documents) especially if the responder is, or was, a stay at home parent. However advice from anyone would be greatly welcomed, regardless of prior experiences. -- - Anthony Hiscox Video Watch Group Public Site Currently Under Development Group Members Site Fully Operational -