1690 Commits

Author SHA1 Message Date
Eugene Livis
09014d34b6 Registering as service providers 2017-01-09 15:58:50 -05:00
Eugene Livis
e3ed9dfc34 Resolved merge conflicts 2017-01-09 14:08:14 -05:00
Eugene Livis
7e291b1124 <erge latest 2017-01-09 10:37:35 -05:00
Eugene Livis
75441946a1 Minor 2017-01-09 10:23:03 -05:00
Richard Cordovano
be7bdced90 Merge in develop branch with text extraction refactoring 2017-01-08 10:48:17 -05:00
Richard Cordovano
837eb1477f Pull in text extraction refactoring and resolve merge conflicts 2017-01-08 10:22:18 -05:00
Richard Cordovano
b0ce3168df Merge pull request #2434 from millmanorama/fix-compiler-warnings
fix compiler warnings about raw types
2017-01-07 11:10:24 -05:00
Richard Cordovano
8fbb19a67d Merge remote-tracking branch 'upstream/develop' into search_improvements 2017-01-07 10:32:34 -05:00
Richard Cordovano
5463d3a719 Remove kws public Server.getIngester, Ingester is not public 2017-01-07 10:30:58 -05:00
Eugene Livis
4299a2326e More work 2017-01-06 16:22:49 -05:00
Eugene Livis
b5e3639167 Fixing comments 2017-01-06 16:18:45 -05:00
Eugene Livis
d23b78f57c Fixing comments 2017-01-06 16:17:42 -05:00
Eugene Livis
bb0c3e55eb Fixing comments 2017-01-06 16:14:53 -05:00
Eugene Livis
21f2efbdcf More work 2017-01-06 16:05:11 -05:00
Eugene Livis
b05dded08a Got inex folder search algorithm to work 2017-01-06 15:48:44 -05:00
Eugene Livis
40cc726a11 First cut at integrating AutopsyServiceProvider 2017-01-05 17:16:27 -05:00
Eugene Livis
7d252864a4 Index folder finding algorithm seems to work 2017-01-05 13:41:12 -05:00
Eugene Livis
4555f7d44d Merge branch 'search_improvements' of https://github.com/sleuthkit/autopsy into solr65 2017-01-05 12:42:55 -05:00
Richard Cordovano
210068e241 Merge in develop branch 2017-01-05 10:24:59 -05:00
Eugene Livis
62ad3e1eb2 First cut of index search algorithm 2017-01-03 16:51:57 -05:00
Richard Cordovano
46369eff44 Update NBM versioning for 4.3.0 2017-01-02 18:45:21 -05:00
Richard Cordovano
13411450aa 4.3.0 preps: DSPs, public API restore, const name 2017-01-02 17:36:59 -05:00
millmanorama
3557f141e1 use UTF-8 encoding for ArtifactTextExtractor streams and readers 2017-01-02 16:45:51 +01:00
millmanorama
4ae0a688bc don't commit unnecessarily 2016-12-31 14:31:11 +01:00
millmanorama
8526427b4f cleanup and comment TextExtractor
cleanup and comment TextExtractor immplementations more.

remove constants left over from merge
2016-12-28 17:30:42 +01:00
millmanorama
f56c2b43c8 move all 'appendix' related code into TikaTextExtractor and simplify TextExtractor interface. 2016-12-28 17:30:32 +01:00
millmanorama
8841f6e773 minor fixes 2016-12-28 17:30:30 +01:00
millmanorama
2d5cd2efc1 comment up Ingester 2016-12-28 17:30:27 +01:00
millmanorama
c94d3de872 move encoding options to StringsTextExtractor 2016-12-28 17:30:25 +01:00
millmanorama
9b85284194 remove unused outerclasses that have copies as innerclasses 2016-12-28 17:30:23 +01:00
millmanorama
c42f687bfb more cleanup
more cleanup
2016-12-28 17:30:15 +01:00
millmanorama
b904c37dd2 remove more unneeded ContentStreams and cleanup logging 2016-12-28 15:03:45 +01:00
millmanorama
0303c96d41 cleanup Ingester.indexChunk 2016-12-28 15:03:04 +01:00
millmanorama
abf21f58ee remove obsolete and unused ContentStreams 2016-12-28 15:03:03 +01:00
millmanorama
2b4bb33798 cleanup up ArtifactExtractor; reduce use of ContentStream 2016-12-28 15:03:01 +01:00
millmanorama
697a7d7a58 reduce method overloads for indexing artifacts 2016-12-28 15:02:59 +01:00
millmanorama
b38171dbd7 make the ByteXXXStream classes inner classes of the TextExtractors that use them. 2016-12-28 15:02:58 +01:00
millmanorama
85af7c57b6 build out ArtifactExtractor 2016-12-28 15:02:56 +01:00
millmanorama
1a70a4e8b2 introduce ArtifactExtractor 2016-12-28 15:02:39 +01:00
millmanorama
359dc16ee5 inline indexChunk 2016-12-28 15:02:23 +01:00
millmanorama
c9795cabcb pull up methods from TextExtractorBase into TextExtractor.java 2016-12-28 15:02:21 +01:00
millmanorama
0f1f8b2211 refactor common chunking algorithm into TextExtractorBase, remove AbstractFileChunk 2016-12-28 15:02:18 +01:00
Richard Cordovano
a5902d50f5 Correctly handle CancellationException in KeywordSearchResultFactory.BlackboardResultWriter 2016-12-19 17:27:42 -05:00
millmanorama
094db06075 fix compiler warnings about raw types 2016-12-16 14:56:41 +01:00
esaunders
0fce991ca0 Removed unnecessary Solr artifacts from build scripts. 2016-12-14 17:11:20 -05:00
esaunders
64990065f2 Merge branch 'solr6_standalone' into 2121_regex_query 2016-12-14 16:49:20 -05:00
esaunders
bcda17746e Updated version number and commented out copying of content and file_name into content_ws. 2016-12-14 15:58:32 -05:00
esaunders
63829ba3bc Updated search runner to use RegexQuery for regular expressions. 2016-12-14 15:56:41 -05:00
esaunders
a991bf7d8e Modified regular expressions for use with new RegexQuery class. 2016-12-14 15:54:04 -05:00
esaunders
020011bff1 Change the ordering of the regex for the last element of the IP address regex because we were only getting IP address hits containing a single digit as the last element, e.g. we would get a hit for 152.163.199.5 instead of 152.163.199.56. 2016-12-14 13:58:18 -05:00