473 Commits

Author SHA1 Message Date
adam-m
81e22f1c2b Store content ids not entire file object to keep track of previous results (less memory required) 2012-07-23 17:46:23 -04:00
adam-m
b2b723751d Tika - use no timeout for now for parse() method 2012-07-23 17:29:45 -04:00
adam-m
1fd1570cb6 Better naming of module events, updated API docs 2012-07-22 19:30:51 -04:00
adam-m
c12bb2a75b minor string buffer optimization 2012-07-19 17:40:01 -04:00
adam-m
ba518de7c8 Add local Tika extract timeout mechanism, similar to that used for Solr indexing 2012-07-18 15:06:53 -04:00
adam-m
1fad291255 TSK-546 Extracted content Arabic files issue
fixes issue when content was escaped twice in some cases, if set node called multiple times on the same content (that should be looked at too)
2012-07-18 13:04:30 -04:00
adam-m
3251f1d65b tika extract jpg not onyl jpeg ext. 2012-07-17 13:40:02 -04:00
adam-m
3dc0fc7b52 show meta info in last chunk only.
Handle unexpected unchecked exception separately.
2012-07-16 21:41:57 -04:00
adam-m
220946e240 - append and index meta-data to Tika extracted content
- attempt not to break words when creating chunks from Tika extracted text
2012-07-12 17:09:52 -04:00
adam-m
0c6a6a9776 If TIKA fails, do string extraction.
Code cleanup.
2012-07-11 13:38:17 -04:00
adam-m
9beced7ba4 Handle case when reader returns less than asked
Fix tika parsers dependencies for some files like MS Office
2012-07-10 17:43:59 -04:00
adam-m
fc4ecf0402 Better index timeout est based on actual byte size to ingest 2012-07-10 14:13:28 -04:00
adam-m
8f26cda926 TSK-519 Add support for files of known filetypes > 100 MB
(first take)
- also fix thunderbird module deps so they work with keyword search module
2012-07-10 14:05:35 -04:00
adam-m
66095ab336 fix max size of field 2012-06-28 13:52:24 -04:00
adam-m
d1fd8e7e63 Add toString() method for better logging 2012-06-28 13:49:07 -04:00
adam-m
7b6e6a4e19 fix sizing 2012-06-28 13:46:00 -04:00
adam-m
ed9dceb502 Always index meta data of known files (skip content), and 0 byte files 2012-06-28 13:34:08 -04:00
adam-m
8ba8775931 Enable label re-sizing 2012-06-28 13:25:51 -04:00
adam-m
ef8371b544 Update ingest manager proxy java docs
Add a method to from manager to the facade
2012-06-28 10:07:48 -04:00
adam-m
f8dfacc63a Merge branch 'master' of https://github.com/sleuthkit/autopsy 2012-06-26 14:07:41 -04:00
adam-m
bd252890c2 Remove more doxygen warnings 2012-06-26 14:07:26 -04:00
Dick Fickling
0f4d01e238 GUI tweaks for Hash and Keyword configuration 2012-06-26 13:34:39 -04:00
adam-m
71b5006906 Merge branch 'master' of https://github.com/sleuthkit/autopsy 2012-06-22 16:27:27 -04:00
adam-m
9f30cf333b Use default field 2012-06-22 16:27:12 -04:00
Dick Fickling
ede0326ab7 Make text consistent between hash and keyword search configuration 2012-06-22 15:03:22 -04:00
Dick Fickling
4e4f21e0a7 Fix TSK-486: Rename ".." to be "[parent folder]" in tree 2012-06-20 16:57:25 -04:00
Dick Fickling
fa7a292640 Ordering/prioritizing Data Content Viewers 2012-06-20 13:12:37 -04:00
adam-m
12d757542f Extracted text viewer - cache last text content for much quicker loading when user browses artifacts for the same content 2012-06-20 12:35:45 -04:00
adam-m
17030ab360 Provide a separate method for reporting number of all solr documents, and number of files (not chunks only).
Do not report number of chunks to the user, only number of files/directories.
2012-06-19 11:20:18 -04:00
adam-m
d445cf9af8 Keyword search: add general tab to configuration 2012-06-19 11:08:04 -04:00
adam-m
6afd841c9d Moved updatekeywords() to search thread to eliminate need of synchronizing ingest and search threads 2012-06-13 16:57:07 -04:00
Dick Fickling
8b36b631f7 Tighter & faster serialization for keyword search lists 2012-06-13 11:24:10 -04:00
adam-m
097e03bc60 Extracted text viewer: deactivate for directories (they have no text content) 2012-06-13 10:59:42 -04:00
Dick Fickling
12f260222b Keyword search edit list panel show only unlocked lists 2012-06-13 10:02:54 -04:00
adam-m
0e268cb166 TSK-504 Hide "locked" lists from list management panel 2012-06-12 14:56:52 -04:00
adam-m
08edc0737e Lower Solr doc cache settings to 16 docs 2012-06-11 11:49:19 -04:00
adam-m
c2909727a5 Revert to allow multiple bb writers threads in user driven GUI keyword search 2012-06-11 10:27:30 -04:00
adam-m
b9e23ab9ec -Minimize solr query mem usage:
Change keyword search non-ingest query to enqueue snippet query threads, not to execute multiple snippet query threads in parallel
-Add snippet for regex query to GUI
2012-06-08 18:19:20 -04:00
adam-m
1fe82e3401 change variable name 2012-06-08 18:15:00 -04:00
adam-m
6c3472918e make column names more consistent with the blackboard result viewer 2012-06-08 15:31:56 -04:00
Dick Fickling
21aeab110e Merge branch 'master' of github.com:sleuthkit/autopsy 2012-06-08 12:20:13 -04:00
Dick Fickling
a594e7c954 Disable searching for phone numbers and ip addresses by default 2012-06-08 12:19:25 -04:00
adam-m
7dfb6905dd Restore showing file/dir info in search results (for alloc files/dirs) 2012-06-08 11:21:59 -04:00
adam-m
46b48ac121 update comment 2012-06-08 11:21:30 -04:00
adam-m
0f484c3d77 Fix check if has background tasks running /pending 2012-06-08 11:21:10 -04:00
adam-m
008d11c494 Merge branch 'master' of https://github.com/sleuthkit/autopsy 2012-06-08 09:58:41 -04:00
adam-m
1fc68bc5e3 keyword search ingest fix concurrency issue on keyword list structure when final searcher starts, update javadoc 2012-06-08 09:58:25 -04:00
Dick Fickling
3b10a8e9e8 Support importing Encase lists, preserve hierarchy 2012-06-08 09:53:19 -04:00
Dick Fickling
98f2b540ce Add delete key handler to Lists Management panel 2012-06-08 09:52:57 -04:00
Dick Fickling
0232dac2ea Remove callback from keyword panel to ingest servece.
Service proactively gets new keywords, to updating manually was unnecessary
2012-06-08 09:52:34 -04:00