C# data scraping from websites

c# html-agility-pack

Question

HI I am somewhat fresh to the C# world. been using JavaScript and PHP since the start of this year. I wish to delete blog entries and comments. Its URL is http://www.somewhereinblog.net.

I want to accomplish the following: 1. I want to use software to log in. Secondly, download the HTML. 3. Then, to distinguish between the contents of posts and comments, use regular expressions, xpath, or whatever else is available.

I've been looking everywhere. Very little was understood. Nevertheless, I am certain that I must utilize "htmlagilitypack." I am unable to add a library to a C# form or console program. Can someone please assist me? I really need this. And after only a week, I'm not really into C#. Therefore, I would appreciate some specific details. waiting impatiently

We appreciate it, brothers.

1
1
10/26/2012 1:22:05 PM

Accepted Answer

  1. Zzz-7-Zzz allows you to log in and download.
  2. as opposed to html-agility-pack I like CsQuery because it enables the usage of jQuery syntax within a string in C# code. This enables you to download html to a string and perform operations on it just as you would with a jQuery-enabled HTML page.
5
9/21/2012 5:40:28 AM


Related Questions





Related

Licensed under: CC-BY-SA with attribution
Not affiliated with Stack Overflow
Licensed under: CC-BY-SA with attribution
Not affiliated with Stack Overflow