0
I Use This!
Activity Not Available

Commits : Listings

Analyzed about 1 year ago. based on code collected about 1 year ago.
Jan 20, 2023 — Jan 20, 2024
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
setup fix: use find_packages to include heroshi.{data,manager.worker} subpackages in distribution. More... over 14 years ago
Optionally use the `yajl` module for JSON parsing if available More... over 14 years ago
fix graceful_stop when _queue_updater_thread was not started (because URLs supplied inline w/o manager) More... over 14 years ago
Added distribute setup script. More... over 14 years ago
cosmetic More... over 14 years ago
manager, storage: eager initialization of storage connection, so manager will fail on the very start if storage is not available More... over 14 years ago
manager: eagerly initialize manager (including prefetch worker, which would init storage connection) More... over 14 years ago
Crawler: catch all HTTP exceptions in fetch() More... over 14 years ago
tests: graceful_stop in tests tearDown More... over 14 years ago
extracted common reraise_errors utility function for propagating errors up to another thread More... over 14 years ago
using json module instead of cjson (works on 2.6 and with simplejson installed) More... over 14 years ago
fix: missed `raise_conflict` in `storage.save_content` declaration More... almost 15 years ago
Extracted logging to module `log`; more smooth initialization of logging. More... almost 15 years ago
More specific configuration errors: ConfigNotFound and ConfigNotSpecified. More... almost 15 years ago
manager: more clear handling of links (now used only in bin/heroshi-append) More... almost 15 years ago
cosmetic More... almost 15 years ago
cosmetic: catch all CrawlError-s, instead of particular two More... almost 15 years ago
fix: worker.Crawler: set visited timestamp even in case of error More... almost 15 years ago
cosmetic: removed unused imports More... almost 15 years ago
manager: don't store links count since crawler doesn't supply links anymore More... almost 15 years ago
cosmetic More... almost 15 years ago
worker.Crawler doesn't do content parsing anymore More... almost 15 years ago
manager: extracted postreport worker single step; also do 2 duplicate report checks More... almost 15 years ago
manager: extracted given_items cache timeout to config under prefetch:cache_timeout key More... almost 15 years ago
data.Cache: added popitem. More compliance with dict. More... almost 15 years ago
bin/check: additional flags are passed to nose More... almost 15 years ago
data.Cache: added `pop` method and fixed expiry race bug More... almost 15 years ago
tests for `data.Cache` More... almost 15 years ago
manager: cli_manager, separate entry-point which runs eventlet.wsgi. More... almost 15 years ago
dns: rewrote dns module without class. Cache must be passed to `dns.gethostbyname` explicitly. More... almost 15 years ago