У меня есть следующий HTML-код
<ol>
<li>If someone is <b>able</b> to do something, they <a href="/wiki/can" title="can">can</a> do it.
<dl>
<dd><i>I'm busy today, so I won't be <b>able</b> to see you.</i></dd>
</dl>
</li>
</ol>
Как я могу извлечь текст между тегами <li>
и <dl>
.
Я пробовал это:
from bs4 import BeautifulSoup
s = """<ol>
<li>If someone is <b>able</b> to do something, they <a href="/wiki/can" title="can">can</a> do it.
<dl>
<dd><i>I'm busy today, so I won't be <b>able</b> to see you.</i></dd>
</dl>
</li>
</ol>
"""
soup = BeautifulSoup(s)
for line in soup.find_all('ol'):
print line.li.get_text()
Это напечатает
If someone is able to do something, they can do it.
I'm busy today, so I won't be able to see you.
Мне нужна только первая строка.
If someone is able to do something, they can do it.