"Traversing" 使用 PHP 简单 HTML DOM 解析器的未嵌套 div

"Traversing" un-nested divs with PHP Simple HTML DOM Parser

我正在尝试解析这样的 HTML 结构,使用 PHP 简单 HTML DOM 解析器(实际代码有点混乱,包含链接和介于两者之间的图像,但这些似乎不是问题):

<div class="members">

    <div class="membergroup">Admin</div>

        <div class="membername">Member 1</div>
        <div class="membername">Member 2</div>
        <div class="membername">Member 3</div>


    <div class="membergroup">Moderator</div>

        <div class="membername">Member 4</div>
        <div class="membername">Member 5</div>


    <div class="membergroup">Member</div>

        <div class="membername">Member 6</div>
        <div class="membername">Member 7</div>

</div>

我找不到一些合理的方法来列出特定组中的所有成员(我无法确定每个组中有多少成员)。某种循环可能是合适的,但我不知道如何打印第 3 组的第 1 个成员(本例中为第 6 个成员)。

有手册,但一直找不到解决办法:http://simplehtmldom.sourceforge.net/manual.htm

div 的结构方式,none 个成员实际上嵌套在 "membergroup" 个 div 中,而是直接在它们下面。

这是我目前的情况:

<?php

    require_once 'scripts/lib/simple_html_dom.php';
    $html = file_get_html('test_reference.html'); // The example shown above

    // We know the total amount of members.
    // However, we don't know who and how many belong to a particular group in advance.
    $membercount = 7;


    $e = $html->find('div.members', 0)->find('div.membergroup', 0);

    if ($e->plaintext === "Admin") {
        echo "We are inside the 'Admin' div now!<br />";

        echo "Member is called: ";

        // ... But we have to go back to the parent,
        // because 'membername' isn't inside 'membergroup' :-(
        echo $e->parent()->find('div.membername', 0)->plaintext;

        echo "<br /><br />";
    }

    // Same thing, but to the third group ('Member') now instead of 'Admin'
    $e = $html->find('div.members', 0)->find('div.membergroup', 2);

    if ($e->plaintext === "Member") {
        echo "We are inside the 'Member' div now!<br />";

        echo "Member is called: ";

        // Wrong! We want the first (0th) member of the 'Member' group.
        echo $e->parent()->find('div.membername', 0)->plaintext;

        echo "<br /><br />";
    }
?>

输出将是:

We are inside the 'Admin' div now! Member is called: Member 1

We are inside the 'Member' div now! Member is called: Member 1 // This is wrong, should be 6

找到要查找的 membergroup 后,在循环中使用以下代码行,直到 returns 另一个 membergroup 或到达最后一个元素:

$e = $e->next_sibling();

PHP Simple HTML DOM Parser - API Reference

element $e->next_sibling()

Returns the next sibling of element, or null if not found.