740 Commits

Author SHA1 Message Date
adam-m
12b8917346 better error msg in indexer blocked by security software 2013-01-22 12:23:46 -05:00
adam-m
34e1b811c3 change java source level to 1.7 2013-01-20 08:48:14 -05:00
adam-m
93de5d7c33 Update versions for 3.0.4 release 2013-01-18 09:56:07 -05:00
adam-m
9194325727 add complete list of MSOffice and OpenOffice extensions to Tika extractor
fixes #133
2013-01-17 13:22:27 -05:00
adam-m
00f57f6bad add MS OFFICE 2007 extensions to tika extractor to process 2013-01-17 12:55:07 -05:00
adam-m
1bca70f08c Share tika jar from keyword search to thunderbird to avoid multiple loads (TODO move Tika to CoreLib) 2013-01-17 10:04:03 -05:00
adam-m
06efec11f6 strip html from message 2013-01-16 14:15:29 -05:00
adam-m
4156200a57 use new message/notify API instead of some modal dialogs 2013-01-16 14:08:50 -05:00
Tim McIver
3a5727b9ff Fixed log message typos. 2013-01-11 12:38:01 -05:00
Tim McIver
e840fa0a0e These changes in support of AUT-740 concerning KeywordSearch 'Advanced
Keyword Search Configuration' dialog box. Renamed 'Save As' button to
'Copy List'. Added cancel button. Altered logic so that when a copy list
operation is aborted, a dialog no longer pops up saying that the list was
saved.
2013-01-11 12:35:02 -05:00
adam-m
44599504ed keyword search ingest: search only images being ingested, not other images present in the index 2013-01-09 17:31:02 -05:00
adam-m
64cec303d7 Merge branch 'master' of https://github.com/Devin148/autopsy 2013-01-08 12:37:08 -05:00
adam-m
f539a14e96 explicitely add icu4j optional pdfbox dependency we need for tika 2013-01-04 17:36:55 -05:00
adam-m
330c3ca45a update module versions and deps for release 2013-01-04 16:39:13 -05:00
adam-m
afb45cab13 lift literal query max rows 2013-01-04 12:48:24 -05:00
adam-m
11e593a39b update max terms 2013-01-04 12:46:30 -05:00
adam-m
f72dbc4c77 config change make changes visible after 15k doc commit kicks in 2013-01-04 12:37:19 -05:00
adam-m
4cd1202c73 move cleanup to later 2013-01-04 12:36:58 -05:00
adam-m
0cb089eea3 better progress updates in between keywords, lifted limit on terms query 2013-01-04 11:02:53 -05:00
adam-m
f4aaac147c keyword search ingest:
- better progress bar resolution.
- send events to update dir tree periodically
2013-01-04 09:49:50 -05:00
adam-m
78af87ec78 Merge branch 'master' of https://github.com/tmciver-basis/autopsy 2013-01-04 09:17:01 -05:00
Tim McIver
cc9d3cb5c7 Updates to keyword list saving/loading to support feature where XML
attributes 'use_for_ingest' and 'ingest_messages' are not written when the
keyword list is exported whereas when the list is saved, those attributes
are written. In support of AUT-690.
2013-01-03 21:16:06 -05:00
adam-m
88dd54c1ef better names in text view 2013-01-03 16:43:31 -05:00
Tim McIver
c03c2f40fd Merge branch 'master' of https://github.com/sleuthkit/autopsy 2013-01-03 15:17:14 -05:00
adam-m
1d4716ca4c text view: fix scrolling to first highlighted hit when text is first loaded 2013-01-03 14:06:34 -05:00
Tim McIver
6713f6481b Removed code that writes attributes 'use_for_ingest' and 'ingest_messages'
in keyword list XML file when exporting keyword lists. These changes are in
support of AUT-690.
2013-01-03 11:24:05 -05:00
adam-m
8e2502a300 Merge branch 'master' of https://github.com/sleuthkit/autopsy 2013-01-03 10:56:51 -05:00
Tim McIver
67fbd28024 Merge branch 'master' of https://github.com/sleuthkit/autopsy 2013-01-03 10:45:44 -05:00
adam-m
bc3ccd16a6 add image_id to Solr schema 2013-01-03 10:34:20 -05:00
adam-m
804991dc3b adjust max terms 2013-01-03 10:16:58 -05:00
adam-m
5eada99ca5 increase token limit, add more comments to schema 2013-01-03 09:33:35 -05:00
adam-m
e6dd4e4607 use FVH highlighter also in Text View.
Also makes highlighting better: highlighted tokens are not longer split on white-space as with original highlighter
2013-01-02 17:55:52 -05:00
Tim McIver
6285beaa5b Merge branch 'master' of https://github.com/sleuthkit/autopsy 2013-01-02 17:42:50 -05:00
Tim McIver
2081639d30 Merge branch 'general-options' 2013-01-02 17:41:24 -05:00
Tim McIver
05360ba78a Updates to code to complete implementation of behavior described in
AUT-729. In short, there is now a 'General' options tab where the user can
select whether the content viewer should remain on the one the user last
selected or if the 'most preferred' one should be used.
2013-01-02 17:34:36 -05:00
adam-m
c19993c480 keyword search:
- use FVH for snippets query for 3x snippet query performance
- index 100k tokens per 1MB chunk instead of 10k, indexing all content
- increase snippet length from 45 to 50
- add limit on regex query to be 20k results matching the regex - todo tune more
- log how many fscontent hits on regex query - will use data to tune in future
- minor code adjustments, debug queries if dev build, etc
2013-01-02 17:31:48 -05:00
adam-m
525a91d72c move module data cleanup to complete() and stop(). Cleanup all data earlier. 2012-12-31 11:50:20 -05:00
adam-m
c0f59153b8 enclose highlight text in html tags for correctness 2012-12-28 18:17:32 -05:00
adam-m
55332e0076 AUT-732 Text view exception
fix Solr escaping html when generating highlight text, set html encoder to default
2012-12-28 18:16:39 -05:00
adam-m
4041814879 remove explicit optimize - not needed, and taking too much time to open case / core 2012-12-28 16:09:56 -05:00
Devin148
d4c69e3c96 Add right click tagging and display tags in directory tree 2012-12-28 15:58:51 -05:00
adam-m
d660df2940 upgrade jericho parser from 3.2 to 3.3 2012-12-28 13:32:15 -05:00
adam-m
e2c8b91dc3 Keyword search - bypass Tika extract handler entirely, do not prepend BOM (used as workaround to bypass Tika), use /update UpdateRequestHandler.
Does not use streaming for now, converts the stream to SolrInputDocument with content String set, before it is indexed.
2012-12-27 13:51:08 -05:00
adam-m
5e0912f6bc Solr 4.0.0 config and schema updates: keep 3.6 behavior for now for better comparison
Do not  use NRT caching - possibly causing heap issues in Solr, keep 3.6 default.
2012-12-27 13:23:25 -05:00
adam-m
f003297fb1 Solr config updates 2012-12-21 17:40:04 -05:00
adam-m
648f25d8a6 fix solr4.0.0 deprecation 2012-12-21 17:31:43 -05:00
adam-m
e5bba26a07 update HL_ANALYZE_CHARS_UNLIMITED to more compliant 2012-12-21 17:26:19 -05:00
adam-m
7493eaac55 log header on tika exception also to main logger 2012-12-21 16:24:45 -05:00
adam-m
f836acc039 better log message when tika read error 2012-12-21 16:23:06 -05:00
adam-m
e50e247557 initial SOlr and Tika upgrade to 4.0.0 / 1.2: updated build, packages and module deps, API changes, initial configuration changes, update to using new handlers, bumped LUCENE version frm 36 to 40 2012-12-21 14:16:01 -05:00