1666 Commits

Author SHA1 Message Date
esaunders
acf05a45e8 Modified message that is displayed in content viewer when we are unable to find the text associated with a keyword hit. 2017-01-10 10:26:11 -05:00
esaunders
d20e70b58b Reverting previous modification of IP address regex as it does not help reduce false positives. 2017-01-10 10:24:57 -05:00
esaunders
ff85d494ed Tightened up IP address regex to not match hits that have more than 4 elements separated by dots. Removed trailing slash from URL regex as it is not needed...the trailing slash is being handled as one of the boundary characters in RegexQuery 2017-01-09 16:39:53 -05:00
esaunders
acae764760 Modified the phone number regex to pick up number that have spaces in them...perhaps this will produce more false positives but in our test data it produces over 1,000 extra numbers that are not found in Autopsy 4.2. Also updated the email regex to find email addresses surrounded in {} sometimes seen in academic publications. 2017-01-04 17:14:06 -05:00
esaunders
ba7f8ab9b3 Consolidated boundary characters into a single list. 2017-01-04 15:19:21 -05:00
esaunders
c172e0f16e Fix for missing characters in snippets and reduce length of snippets in an attempt to more closely match previous version of Autopsy. 2017-01-04 12:23:42 -05:00
esaunders
8432fec205 Updated email and url regexes to be case insensitive. 2017-01-04 12:22:17 -05:00
esaunders
6304300f62 Merge branch 'develop' of github.com:sleuthkit/autopsy into 2121_regex_query 2017-01-03 12:56:04 -05:00
esaunders
45c2b0c065 Set results max page size to 512. 2017-01-03 12:48:39 -05:00
esaunders
c1f326775a Added result paging support. 2017-01-03 12:47:16 -05:00
Richard Cordovano
46369eff44 Update NBM versioning for 4.3.0 2017-01-02 18:45:21 -05:00
Richard Cordovano
13411450aa 4.3.0 preps: DSPs, public API restore, const name 2017-01-02 17:36:59 -05:00
esaunders
681699467d Needed to tweak the CC regex and our boundary characters to successfully match CC numbers in our test data set. 2016-12-28 14:37:51 -05:00
esaunders
bdfe6e2c14 More comment clarification for CCN_REGEX 2016-12-28 10:24:44 -05:00
esaunders
3c585b1321 Fixed comment for CCN_REGEX 2016-12-28 10:08:26 -05:00
esaunders
259a4ec1c9 Restructured HighlightedText.attemptManualHighlighting() 2016-12-27 17:13:08 -05:00
esaunders
8d82672f2f Merge branch 'develop' of github.com:sleuthkit/autopsy into 2121_regex_query 2016-12-27 17:10:39 -05:00
esaunders
0e925e6823 Modified creation of regex keyword hits to break on a whitespace or punctuation boundary to support consistent highlighting. Also added HighlightedText.attemptManualHighlighting() for those situations where the Lucene highlighter doesn't give us useful results. 2016-12-27 17:00:00 -05:00
esaunders
4b80395b9d Replaced credit card regular expression with one that does not attempt to limit the first digit to 3-6. The old regular expression resulted in an error from Solr stating: Determinizing .*[3-6]([ -]?[0-9]){11,18}.* would result in more than 10000 states. 2016-12-27 16:46:41 -05:00
Richard Cordovano
a5902d50f5 Correctly handle CancellationException in KeywordSearchResultFactory.BlackboardResultWriter 2016-12-19 17:27:42 -05:00
esaunders
0fce991ca0 Removed unnecessary Solr artifacts from build scripts. 2016-12-14 17:11:20 -05:00
esaunders
64990065f2 Merge branch 'solr6_standalone' into 2121_regex_query 2016-12-14 16:49:20 -05:00
esaunders
bcda17746e Updated version number and commented out copying of content and file_name into content_ws. 2016-12-14 15:58:32 -05:00
esaunders
63829ba3bc Updated search runner to use RegexQuery for regular expressions. 2016-12-14 15:56:41 -05:00
esaunders
a991bf7d8e Modified regular expressions for use with new RegexQuery class. 2016-12-14 15:54:04 -05:00
esaunders
020011bff1 Change the ordering of the regex for the last element of the IP address regex because we were only getting IP address hits containing a single digit as the last element, e.g. we would get a hit for 152.163.199.5 instead of 152.163.199.56. 2016-12-14 13:58:18 -05:00
esaunders
c4561579f9 Perform Java regex validation for now even though Lucene regex syntax is a subset of Java. 2016-12-14 13:51:46 -05:00
esaunders
89f7af8943 Merge branch 'develop' of github.com:sleuthkit/autopsy into 2121_regex_query 2016-12-14 12:51:16 -05:00
Eugene Livis
d1616cdeb6 Fixed a very misleading error mesage 2016-12-14 09:56:25 -05:00
Richard Cordovano
bb1975b9c4 Merge pull request #2428 from zhhl/2123-sortSolrResultToKeepConsistantKeywordPreview
2123: Sort the Solr results to keep KeywordSearch Preview pick up the…
2016-12-14 09:51:08 -05:00
U-BASIS\zhaohui
2711788582 2123: correction 2016-12-13 17:42:02 -05:00
U-BASIS\zhaohui
05a6fa8d37 2123: clean up 2016-12-13 17:38:22 -05:00
U-BASIS\zhaohui
8a1f272738 2123: let Solr do ascending sorting to let us have a consistant result 2016-12-13 17:33:41 -05:00
esaunders
0517f5065e Merge branch 'develop' of github.com:sleuthkit/autopsy into search_improvements 2016-12-13 12:59:44 -05:00
esaunders
32711594b5 Initial cleanup. 2016-12-13 12:56:52 -05:00
U-BASIS\zhaohui
4a0202cea9 2123: Sort the Solr results to keep KeywordSearch Preview pick up the same result each time 2016-12-11 09:56:57 -05:00
Ann Priestman
231e87187d Add dialog to allow the user to add multiple keywords at a time. 2016-12-08 09:58:31 -05:00
esaunders
a782e52f80 Removed filterOneHitPerDocument() since (a) it's use prevents the display of hits across multiple pages/chunks and (b) QueryResults.writeAllHitsToBlackBoard() takes care of ensuring that only a single blackboard artifact is created per document. 2016-12-07 16:17:24 -05:00
esaunders
83f8d575e9 Add quotes around the keyword when the search results are not available to make highlighting work correctly. 2016-12-07 16:14:00 -05:00
esaunders
c7c6e5abd4 Commented out call to filterOneHitPerDocument() and added a lengthy comment as to why I think its unnecessary. 2016-12-07 15:04:47 -05:00
esaunders
09c5b3ccde Wrap keyword in quotes to make highlighting work when keyword hits are accessed through the tree. 2016-12-06 11:17:12 -05:00
esaunders
7bcb31495e Make pattern matching case insensitive. 2016-12-06 10:20:14 -05:00
esaunders
80ae7e8250 Merge branch 'develop' of github.com:sleuthkit/autopsy into 1727-regexes-with-whitespace 2016-12-05 13:44:09 -05:00
esaunders
218a55c69c Surround search term with /.* .*/ and eliminate filtering down to one document per hit. 2016-12-05 13:42:24 -05:00
esaunders
f96f831bb6 Make text chunks 32k or smaller. 2016-12-05 13:40:53 -05:00
Richard Cordovano
57e506b8ae Delete KWS issue report file, move into JIRA 2016-12-02 12:30:06 -05:00
esaunders
950ac1aea5 Merge branch 'develop' of github.com:sleuthkit/autopsy into 1727-regexes-with-whitespace 2016-11-28 13:32:29 -05:00
Richard Cordovano
5638cf5b79 Merge pull request #2405 from zhhl/2050-NPEinRegexSearch
2050 handle the KeywordSearchException while call KeywordSearchQuery:performQuery()
2016-11-22 17:44:51 -05:00
esaunders
be855a3e7f Really remove SOLR_HOME setting this time. 2016-11-21 15:11:00 -05:00
esaunders
6269859d0f Removed SOLR_HOME setting since it ends up overwriting setting in Server.java 2016-11-21 14:30:24 -05:00