嗨,我正在使用此代码从ebay获取一些数据。
string url = textBox1.Text;
get_all_link(url);
private void get_all_link(string url)
{
var webGet = new HtmlWeb();
try
{
var document = webGet.Load(url);
MessageBox.Show(url);
var metaTags = document.DocumentNode.SelectNodes("//div[@class='ttl']/a");
if (metaTags != null)
{
foreach (var tag in metaTags)
{
string link = tag.Attributes["href"].Value;
links.Add(link);
}
}
var next = document.DocumentNode.SelectNodes("//td[@class='botpg-next']/a");
if (next != null)
{
string link = "http://www.ebay.com" + next;
get_all_link(link);
}
else return;
}
catch (Exception f)
{
MessageBox.Show(f.ToString());
TextWriter tw = new StreamWriter("data.txt");
tw.WriteLine(f.ToString());
tw.Close();
}
}
这是要解析的链接: http ://www.ebay.com/sch/i.html?_nkw = gruen + -sara + -quartz + -embassy + -bob + -robert + -elephants + -adidas&__sacat = 0& LL_Auction = 1& _dmpt = Watswatches&_odkw = gruen-- sara + -quartz + -embassy + -bob + -robert + -elephants + -adidas&_osacat = 1&_trksid = p3286.c0.m270.l1313并且它在文件中写入以下异常
System.Net.WebException: The remote name could not be resolved: 'www.ebay.comhtmlagilitypack.htmlnodecollection' at System.Net.HttpWebRequest.GetResponse() at HtmlAgilityPack.HtmlWeb.Get(Uri uri, String method, String path, HtmlDocument doc, IWebProxy proxy, ICredentials creds) in C:\Source\htmlagilitypack\Trunk\HtmlAgilityPack\HtmlWeb.cs:line 1446 at HtmlAgilityPack.HtmlWeb.LoadUrl(Uri uri, String method, WebProxy proxy, NetworkCredential creds) in C:\Source\htmlagilitypack\Trunk\HtmlAgilityPack\HtmlWeb.cs:line 1563 at HtmlAgilityPack.HtmlWeb.Load(String url, String method) in C:\Source\htmlagilitypack\Trunk\HtmlAgilityPack\HtmlWeb.cs:line 1152 at HtmlAgilityPack.HtmlWeb.Load(String url) in C:\Source\htmlagilitypack\Trunk\HtmlAgilityPack\HtmlWeb.cs:line 1107 at BackgroundWorker.Form1.get_all_link(String url) in C:\Documents and Settings\maruf\My Documents\Visual Studio 2008\Projects\BackgroundWorker\BackgroundWorker\Form1.cs:line 86
我在这里查了一些帖子。但无法解决这个问题。我怎么解决这个?有什么建议??提前致谢 :)
var next = document.DocumentNode.SelectNodes("//td[@class='botpg-next']/a");
返回node
对象而不是string
。你需要使用这个对象的一个属性,可能是像href
或link
这样的东西,并相应地使用它。