Obtenir une cellule de table imbriquée avec HtmlAgilityPack

c# html-agility-pack html-parsing

Question

J'ai la page HTML suivante:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<html>

<head>
</head>

<body>
<div style="LEFT: 135; WIDTH: 450; POSITION: absolute; TOP: 70">
  <a name="content_begins"></a>
  <table cellSpacing="0" cellPadding="0" width="100%" border="0">
    <tr>
      <td vAlign="top">
      <table class="PortalGadget" cellSpacing="0" cellPadding="0" width="100%" border="0">
        <tr>
          <td>
          <table cellSpacing="6" cellPadding="0" border="0">
            <form name="authenticator" ACTION="auth-cup" METHOD="POST" ENCTYPE="application/x-www-form-urlencoded">
              <tr bgColor="#6c8899">
                <td><font face="Trebuchet MS" color="#ffffff">&nbsp;*** You Have Already Logged In ***</font></td>
              </tr>

              <tr height="20">
                <td vAlign="bottom" height="20">
                <font face="Trebuchet MS" size="-1">Context:</font></td>
              </tr>
              <tr>
                <td></td>
              </tr>
              <tr>
                <td vAlign="bottom" height="20">
                <font face="Trebuchet MS" size="-1">Username:</font></td>
              </tr>
              <tr>
                <td><input size="40" name="username"></td>
              </tr>
              <tr>
                <td><font face="Trebuchet MS" size="-1">Password:</font></td>
              </tr>
              <tr>
                <td><input type="password" size="40" name="password"></td>
              </tr>
              <tr>
                <td vAlign="bottom" height="20">
                <font face="Trebuchet MS" size="-1">Destination:</font></td>
              </tr>
              <tr>
                <td>
                <input size="40" name="url" value="http://ya.ru/"></td>
              </tr>
              <tr>
                <!---- Hidden field to identify forward/reverse proxy path ------>
                <input TYPE="hidden" NAME="proxypath" VALUE="forward">
              </tr>
              <tr>
                <td vAlign="bottom" align="left">
                <input type="submit" alt="Login" value="Login" border="0" name="loginButton">&nbsp;&nbsp;
                <input type="reset" alt="Reset" value="Reset"></td>
              </tr>
              <tr height="5">
                <td height="5"><hr SIZE="2"></td>
              </tr>
              <tr>
                <td><font face="Trebuchet MS" size="-1">
                <p>Copyright 1999-2003 Novell, Inc. All rights reserved.</p>
                </font></td>
              </tr>
            </form>
            </tbody>
          </table>
          </td>
        </tr>
      </table>
      </td>
    </tr>
  </table>
</div>
</body>
</html>

Je veux obtenir le texte "Vous êtes déjà connecté" de td. J'ai ce code:

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<html>

<head>
</head>

<body>
<div style="LEFT: 135; WIDTH: 450; POSITION: absolute; TOP: 70">
  <a name="content_begins"></a>
  <table cellSpacing="0" cellPadding="0" width="100%" border="0">
    <tr>
      <td vAlign="top">
      <table class="PortalGadget" cellSpacing="0" cellPadding="0" width="100%" border="0">
        <tr>
          <td>
          <table cellSpacing="6" cellPadding="0" border="0">
            <form name="authenticator" ACTION="auth-cup" METHOD="POST" ENCTYPE="application/x-www-form-urlencoded">
              <tr bgColor="#6c8899">
                <td><font face="Trebuchet MS" color="#ffffff">&nbsp;*** You Have Already Logged In ***</font></td>
              </tr>

              <tr height="20">
                <td vAlign="bottom" height="20">
                <font face="Trebuchet MS" size="-1">Context:</font></td>
              </tr>
              <tr>
                <td></td>
              </tr>
              <tr>
                <td vAlign="bottom" height="20">
                <font face="Trebuchet MS" size="-1">Username:</font></td>
              </tr>
              <tr>
                <td><input size="40" name="username"></td>
              </tr>
              <tr>
                <td><font face="Trebuchet MS" size="-1">Password:</font></td>
              </tr>
              <tr>
                <td><input type="password" size="40" name="password"></td>
              </tr>
              <tr>
                <td vAlign="bottom" height="20">
                <font face="Trebuchet MS" size="-1">Destination:</font></td>
              </tr>
              <tr>
                <td>
                <input size="40" name="url" value="http://ya.ru/"></td>
              </tr>
              <tr>
                <!---- Hidden field to identify forward/reverse proxy path ------>
                <input TYPE="hidden" NAME="proxypath" VALUE="forward">
              </tr>
              <tr>
                <td vAlign="bottom" align="left">
                <input type="submit" alt="Login" value="Login" border="0" name="loginButton">&nbsp;&nbsp;
                <input type="reset" alt="Reset" value="Reset"></td>
              </tr>
              <tr height="5">
                <td height="5"><hr SIZE="2"></td>
              </tr>
              <tr>
                <td><font face="Trebuchet MS" size="-1">
                <p>Copyright 1999-2003 Novell, Inc. All rights reserved.</p>
                </font></td>
              </tr>
            </form>
            </tbody>
          </table>
          </td>
        </tr>
      </table>
      </td>
    </tr>
  </table>
</div>
</body>
</html>

Mais je ne peux pas obtenir le texte "Vous êtes déjà connecté" en code. Comment obtenir des valeurs de cellules de tableau imbriquées dans ce scénario?

Réponse acceptée

Hmmm j'ai trouvé la solution me semble:

string value = doc.DocumentNode.SelectNodes("//table[@class='PortalGadget']/tr/td/table/tr/td")[0].InnerText;

Réponse populaire

Pour sélectionner des nœuds à partir d'un autre nœud, vous devez mettre un '.' au début du XPath.

Ainsi, table.SelectNodes("//table") devient table.SelectNodes(".//table")




Sous licence: CC-BY-SA with attribution
Non affilié à Stack Overflow
Est-ce KB légal? Oui, apprenez pourquoi
Sous licence: CC-BY-SA with attribution
Non affilié à Stack Overflow
Est-ce KB légal? Oui, apprenez pourquoi