millmanorama
|
8841f6e773
|
minor fixes
|
2016-12-28 17:30:30 +01:00 |
|
millmanorama
|
2d5cd2efc1
|
comment up Ingester
|
2016-12-28 17:30:27 +01:00 |
|
millmanorama
|
c94d3de872
|
move encoding options to StringsTextExtractor
|
2016-12-28 17:30:25 +01:00 |
|
millmanorama
|
9b85284194
|
remove unused outerclasses that have copies as innerclasses
|
2016-12-28 17:30:23 +01:00 |
|
millmanorama
|
c42f687bfb
|
more cleanup
more cleanup
|
2016-12-28 17:30:15 +01:00 |
|
esaunders
|
bdfe6e2c14
|
More comment clarification for CCN_REGEX
|
2016-12-28 10:24:44 -05:00 |
|
esaunders
|
3c585b1321
|
Fixed comment for CCN_REGEX
|
2016-12-28 10:08:26 -05:00 |
|
millmanorama
|
b904c37dd2
|
remove more unneeded ContentStreams and cleanup logging
|
2016-12-28 15:03:45 +01:00 |
|
millmanorama
|
0303c96d41
|
cleanup Ingester.indexChunk
|
2016-12-28 15:03:04 +01:00 |
|
millmanorama
|
abf21f58ee
|
remove obsolete and unused ContentStreams
|
2016-12-28 15:03:03 +01:00 |
|
millmanorama
|
2b4bb33798
|
cleanup up ArtifactExtractor; reduce use of ContentStream
|
2016-12-28 15:03:01 +01:00 |
|
millmanorama
|
697a7d7a58
|
reduce method overloads for indexing artifacts
|
2016-12-28 15:02:59 +01:00 |
|
millmanorama
|
b38171dbd7
|
make the ByteXXXStream classes inner classes of the TextExtractors that use them.
|
2016-12-28 15:02:58 +01:00 |
|
millmanorama
|
85af7c57b6
|
build out ArtifactExtractor
|
2016-12-28 15:02:56 +01:00 |
|
millmanorama
|
1a70a4e8b2
|
introduce ArtifactExtractor
|
2016-12-28 15:02:39 +01:00 |
|
millmanorama
|
359dc16ee5
|
inline indexChunk
|
2016-12-28 15:02:23 +01:00 |
|
millmanorama
|
c9795cabcb
|
pull up methods from TextExtractorBase into TextExtractor.java
|
2016-12-28 15:02:21 +01:00 |
|
millmanorama
|
0f1f8b2211
|
refactor common chunking algorithm into TextExtractorBase, remove AbstractFileChunk
|
2016-12-28 15:02:18 +01:00 |
|
esaunders
|
259a4ec1c9
|
Restructured HighlightedText.attemptManualHighlighting()
|
2016-12-27 17:13:08 -05:00 |
|
esaunders
|
8d82672f2f
|
Merge branch 'develop' of github.com:sleuthkit/autopsy into 2121_regex_query
|
2016-12-27 17:10:39 -05:00 |
|
esaunders
|
0e925e6823
|
Modified creation of regex keyword hits to break on a whitespace or punctuation boundary to support consistent highlighting. Also added HighlightedText.attemptManualHighlighting() for those situations where the Lucene highlighter doesn't give us useful results.
|
2016-12-27 17:00:00 -05:00 |
|
esaunders
|
4b80395b9d
|
Replaced credit card regular expression with one that does not attempt to limit the first digit to 3-6. The old regular expression resulted in an error from Solr stating: Determinizing .*[3-6]([ -]?[0-9]){11,18}.* would result in more than 10000 states.
|
2016-12-27 16:46:41 -05:00 |
|
Richard Cordovano
|
a5902d50f5
|
Correctly handle CancellationException in KeywordSearchResultFactory.BlackboardResultWriter
|
2016-12-19 17:27:42 -05:00 |
|
millmanorama
|
094db06075
|
fix compiler warnings about raw types
|
2016-12-16 14:56:41 +01:00 |
|
esaunders
|
0fce991ca0
|
Removed unnecessary Solr artifacts from build scripts.
|
2016-12-14 17:11:20 -05:00 |
|
esaunders
|
64990065f2
|
Merge branch 'solr6_standalone' into 2121_regex_query
|
2016-12-14 16:49:20 -05:00 |
|
esaunders
|
bcda17746e
|
Updated version number and commented out copying of content and file_name into content_ws.
|
2016-12-14 15:58:32 -05:00 |
|
esaunders
|
63829ba3bc
|
Updated search runner to use RegexQuery for regular expressions.
|
2016-12-14 15:56:41 -05:00 |
|
esaunders
|
a991bf7d8e
|
Modified regular expressions for use with new RegexQuery class.
|
2016-12-14 15:54:04 -05:00 |
|
esaunders
|
020011bff1
|
Change the ordering of the regex for the last element of the IP address regex because we were only getting IP address hits containing a single digit as the last element, e.g. we would get a hit for 152.163.199.5 instead of 152.163.199.56.
|
2016-12-14 13:58:18 -05:00 |
|
esaunders
|
c4561579f9
|
Perform Java regex validation for now even though Lucene regex syntax is a subset of Java.
|
2016-12-14 13:51:46 -05:00 |
|
esaunders
|
89f7af8943
|
Merge branch 'develop' of github.com:sleuthkit/autopsy into 2121_regex_query
|
2016-12-14 12:51:16 -05:00 |
|
Eugene Livis
|
d1616cdeb6
|
Fixed a very misleading error mesage
|
2016-12-14 09:56:25 -05:00 |
|
Richard Cordovano
|
bb1975b9c4
|
Merge pull request #2428 from zhhl/2123-sortSolrResultToKeepConsistantKeywordPreview
2123: Sort the Solr results to keep KeywordSearch Preview pick up the…
|
2016-12-14 09:51:08 -05:00 |
|
U-BASIS\zhaohui
|
2711788582
|
2123: correction
|
2016-12-13 17:42:02 -05:00 |
|
U-BASIS\zhaohui
|
05a6fa8d37
|
2123: clean up
|
2016-12-13 17:38:22 -05:00 |
|
U-BASIS\zhaohui
|
8a1f272738
|
2123: let Solr do ascending sorting to let us have a consistant result
|
2016-12-13 17:33:41 -05:00 |
|
esaunders
|
0517f5065e
|
Merge branch 'develop' of github.com:sleuthkit/autopsy into search_improvements
|
2016-12-13 12:59:44 -05:00 |
|
esaunders
|
32711594b5
|
Initial cleanup.
|
2016-12-13 12:56:52 -05:00 |
|
U-BASIS\zhaohui
|
4a0202cea9
|
2123: Sort the Solr results to keep KeywordSearch Preview pick up the same result each time
|
2016-12-11 09:56:57 -05:00 |
|
Ann Priestman
|
231e87187d
|
Add dialog to allow the user to add multiple keywords at a time.
|
2016-12-08 09:58:31 -05:00 |
|
esaunders
|
a782e52f80
|
Removed filterOneHitPerDocument() since (a) it's use prevents the display of hits across multiple pages/chunks and (b) QueryResults.writeAllHitsToBlackBoard() takes care of ensuring that only a single blackboard artifact is created per document.
|
2016-12-07 16:17:24 -05:00 |
|
esaunders
|
83f8d575e9
|
Add quotes around the keyword when the search results are not available to make highlighting work correctly.
|
2016-12-07 16:14:00 -05:00 |
|
esaunders
|
c7c6e5abd4
|
Commented out call to filterOneHitPerDocument() and added a lengthy comment as to why I think its unnecessary.
|
2016-12-07 15:04:47 -05:00 |
|
esaunders
|
09c5b3ccde
|
Wrap keyword in quotes to make highlighting work when keyword hits are accessed through the tree.
|
2016-12-06 11:17:12 -05:00 |
|
esaunders
|
7bcb31495e
|
Make pattern matching case insensitive.
|
2016-12-06 10:20:14 -05:00 |
|
esaunders
|
80ae7e8250
|
Merge branch 'develop' of github.com:sleuthkit/autopsy into 1727-regexes-with-whitespace
|
2016-12-05 13:44:09 -05:00 |
|
esaunders
|
218a55c69c
|
Surround search term with /.* .*/ and eliminate filtering down to one document per hit.
|
2016-12-05 13:42:24 -05:00 |
|
esaunders
|
f96f831bb6
|
Make text chunks 32k or smaller.
|
2016-12-05 13:40:53 -05:00 |
|
Richard Cordovano
|
57e506b8ae
|
Delete KWS issue report file, move into JIRA
|
2016-12-02 12:30:06 -05:00 |
|