openhub.net
Black Duck Software, Inc.
Open Hub
Follow @
OH
Sign In
Join Now
Projects
People
Organizations
Tools
Blog
BDSA
Projects
People
Projects
Organizations
Forums
Apache Nutch
Settings
|
Report Duplicate
21
I Use This!
×
Login Required
Log in to Open Hub
Remember Me
Activity Not Available
Commits
: Listings
Analyzed
about 1 year
ago. based on code collected
about 1 year
ago.
Jan 16, 2023 — Jan 16, 2024
Showing page 1 of 3
Search / Filter on:
Commit Message
Contributor
Files Modified
Lines Added
Lines Removed
Code Location
Date
NUTCH-3024 Remove flaky 'dependency check' target (#795)
Lewis John McGibbney
More...
about 1 year ago
Merge pull request #796 from DigitalPebble/NUTCH-3025
Sebastian Nagel
More...
about 1 year ago
Merged changes from master; improved Javadoc and exception handling
Julien Nioche
More...
about 1 year ago
Merge branch 'NUTCH-3017', closes #793
Sebastian Nagel
More...
about 1 year ago
[NUTCH-3017] Allow fast-urlfilter to load from HDFS/S3 and support gzipped input - use Hadoop-provided compression codecs - update description of property urlfilter.fast.file
Sebastian Nagel
More...
about 1 year ago
Added filtering on whole string + documented config in nutch-default + fixed tests
Julien Nioche
More...
about 1 year ago
NUTCH-3020 -- ParseSegment should check for okhttp's truncation flag (#794)
Tim Allison
More...
about 1 year ago
NUTCH-3019 -- update Tika (#797)
Tim Allison
More...
about 1 year ago
[NUTCH-3025^Curlfilter-fast to filter based on the length of the URL
Julien Nioche
More...
about 1 year ago
NUTCH-3014 Standardize Job names (#789)
Lewis John McGibbney
More...
about 1 year ago
[NUTCH-3017] Allow fast-urlfilter to load from HDFS/S3 and support gzipped input
Julien Nioche
More...
about 1 year ago
NUTCH-3015 Add more CI steps to GitHub master-build.yml (#790)
Lewis John McGibbney
More...
about 1 year ago
NUTCH-3013 Employ commons-lang3's StopWatch to simplify timing logic (#788)
Lewis John McGibbney
More...
about 1 year ago
NUTCH-2990 HttpRobotRulesParser to follow 5 redirects as specified by RFC 9309 (#779)
Sebastian Nagel
More...
about 1 year ago
Merge pull request #776 from tballison/NUTCH-2959
Tim Allison
More...
over 1 year ago
NUTCH-3012 SegmentReader when dumping with option -recode: NPE on unparsed documents - fall back to UTF-8 when stringifying the content of unparsed documents
Sebastian Nagel
More...
over 1 year ago
update howto_upgrade_tika.txt
tallison
More...
over 1 year ago
Working now locally and with Seb's single_node_cluster tests
tallison
More...
over 1 year ago
Merge remote-tracking branch 'upstream/master' into NUTCH-2959
tallison
More...
over 1 year ago
NUTCH-3011 HttpRobotRulesParser: handle HTTP 429 Too Many Requests same as server errors (HTTP 5xx)
Sebastian Nagel
More...
over 1 year ago
NUTCH-2853 bin/nutch: remove deprecated commands solrindex, solrdedup, solrclean
Sebastian Nagel
More...
over 1 year ago
NUTCH-2897 Do not supress deprecated API warnings - deprecate constructor of NutchJob - remove deprocated call to Object.finalize() from Plugin.finalize()
Sebastian Nagel
More...
over 1 year ago
NUTCH-3010 Injector: count unique number of injected URLs - add counter urls_injected_unique - improve log messages reporting the counts of injected/merged URLs
Sebastian Nagel
More...
over 1 year ago
NUTCH-3009 Upgrade to Hadoop 3.3.6
Sebastian Nagel
More...
over 1 year ago
NUTCH-3007 Fix impossible casts - remove code blocks (else clauses) unneeded and containing impossible casts
Sebastian Nagel
More...
over 1 year ago
NUTCH-2852 SpotBugs: Method invokes System.exit(...) - remove all calls of System.exit(...) in methods except main(args) of various "checker" tools
Sebastian Nagel
More...
over 1 year ago
Merge pull request #778 from tballison/NUTCH-3004
Tim Allison
More...
over 1 year ago
NUTCH-3004 -- propagate ssl exception if message doesn't match "handshake alert..."
tballison
More...
over 1 year ago
NUTCH-2959 -- downgrade commons-io to match the version we expect to come out with Hadoop 3.4.0.
tallison
More...
over 1 year ago
NUTCH-2959 -- bump commons-io
tallison
More...
over 1 year ago
←
1
2
3
→
This site uses cookies to give you the best possible experience. By using the site, you consent to our use of cookies. For more information, please see our
Privacy Policy
Agree