pyTweetBot.tools package

Submodules

pyTweetBot.tools.PageParser module

class pyTweetBot.tools.PageParser.PageParser(url, timeout=20)

Bases: object

This is a class to retrieve text from HTML page given an URL.

html

Get HTML :return:

raw_title

Raw title :return:

reload(url=u'')

Reload URL

text

Get text :return:

title

Page’s title :return:

url

Loaded URL :return:

exception pyTweetBot.tools.PageParser.PageParserRetrievalError

Bases: exceptions.Exception

exception pyTweetBot.tools.PageParser.UnknownEncoding

Bases: exceptions.Exception

Unknown encoding exception

pyTweetBot.tools.strings module

Module contents