openhub.net
Black Duck Software, Inc.
Open Hub
Follow @
OH
Sign In
Join Now
Projects
People
Organizations
Tools
Blog
BDSA
Projects
People
Projects
Organizations
Forums
H
heroshi
Settings
|
Report Duplicate
0
I Use This!
×
Login Required
Log in to Open Hub
Remember Me
Activity Not Available
Commits
: Listings
Analyzed
about 1 year
ago. based on code collected
about 1 year
ago.
Jan 20, 2023 — Jan 20, 2024
Showing page 8 of 9
Search / Filter on:
Commit Message
Contributor
Files Modified
Lines Added
Lines Removed
Code Location
Date
extracted HEROSHI_VERSION and REAL_USER_AGENT from heroshi/__init__.py to settings
Sergey Shepelev
More...
almost 15 years ago
renamed heroshi.worker.worker to heroshi.worker.Crawler because it honestly contains only Crawler class
Sergey Shepelev
More...
almost 15 years ago
README: added repo URL
Sergey Shepelev
More...
almost 15 years ago
moved configs to separate `etc` directory
Sergey Shepelev
More...
almost 15 years ago
fixed imports in worker.tests, but the tests are still NOT fixed
Sergey Shepelev
More...
almost 15 years ago
cosmetic: annotate unused options in cli_append
Sergey Shepelev
More...
almost 15 years ago
cosmetic: changed format of shared.error.Error str(), unicode() and repr()
Sergey Shepelev
More...
almost 15 years ago
cosmetic: removed unused BIND_PORT from shared package
Sergey Shepelev
More...
almost 15 years ago
`api.report_results` now accepts a single item and is thus renamed to `api.report_result`
Sergey Shepelev
More...
almost 15 years ago
cosmetic: using package-relative imports
Sergey Shepelev
More...
almost 15 years ago
cosmetic: removed unused random_useragent()
Sergey Shepelev
More...
almost 15 years ago
`len() == 0` instead of `len() is 0`
Sergey Shepelev
More...
almost 15 years ago
added contact info into REAL_USER_AGENT
Sergey Shepelev
More...
almost 15 years ago
cosmetic: removed unused keys from config
Sergey Shepelev
More...
almost 15 years ago
cosmetic: imports sorted
Sergey Shepelev
More...
almost 15 years ago
worker: unicode a bit of logging. There were unicode errors while logging. This fix should aid those errors.
Sergey Shepelev
More...
almost 15 years ago
worker: reraising KeyboardInterrupt so worker gets stopped even if timing was so exception raised inside conn.request or page.parse
Sergey Shepelev
More...
almost 15 years ago
worker: extracted setting report['visited'] to one place
Sergey Shepelev
More...
almost 15 years ago
manager: using new-random view to get random urls across all dataset
Sergey Shepelev
More...
almost 15 years ago
fix: worker: was always passing max_queue_size to get_crawl_queue instead of only remainder to-become-full
Sergey Shepelev
More...
almost 15 years ago
worker: extracted full queue pause value to config
Sergey Shepelev
More...
almost 15 years ago
worker: added socket timeout to crawling
Sergey Shepelev
More...
almost 15 years ago
manager: removed the local in-memory NEW_URLS queue
Sergey Shepelev
More...
almost 15 years ago
manager: removed document 'given' sharing lock
Sergey Shepelev
More...
almost 15 years ago
worker: reporting mechanics changed to report each URL just after it was crawled, reports buffer removed
Sergey Shepelev
More...
almost 15 years ago
shared.api uses Factory pool of Http() to cache connections to manager
Sergey Shepelev
More...
almost 15 years ago
data: FactoryPool is also exported
Sergey Shepelev
More...
almost 15 years ago
cosmetic: removed unused imports
Sergey Shepelev
More...
almost 15 years ago
CouchDB queue view renamed from 'not-given' to 'new'
Sergey Shepelev
More...
almost 15 years ago
fix: wrong use of class attributes
Sergey Shepelev
More...
almost 15 years ago
←
1
2
3
4
5
6
7
8
9
→
This site uses cookies to give you the best possible experience. By using the site, you consent to our use of cookies. For more information, please see our
Privacy Policy
Agree