get the web page source with the rendered html from javascript

asp.net c# html-agility-pack vb.net

Question

If I use this

WebClient client = new WebClient();
String htmlCode = client.DownloadString("http://test.net");

I am able to use the agility pack to scan the html and get most of the tags that I need but its missing the html that is rendered by the javascript.

My question is, how do I get the final rendered page source using c#. Is there something more to the WebClient to get the final rendered source after javascript is run?

1
5
8/30/2012 4:51:40 PM

Popular Answer

The HTML Agility Pack alone is not enough to do what you want, You need a javascript engine as well. To do that, you may want to check out something like Geckofx, which will allow you to embed a fully functional web browser into your application, and than allow you to programatically access the contents of the dom after the page has rendered.

http://code.google.com/p/geckofx/

1
8/30/2012 4:56:13 PM


Related Questions





Related

Licensed under: CC-BY-SA with attribution
Not affiliated with Stack Overflow
Licensed under: CC-BY-SA with attribution
Not affiliated with Stack Overflow