adam-m
81e22f1c2b
Store content ids not entire file object to keep track of previous results (less memory required)
2012-07-23 17:46:23 -04:00
adam-m
b2b723751d
Tika - use no timeout for now for parse() method
2012-07-23 17:29:45 -04:00
adam-m
1fd1570cb6
Better naming of module events, updated API docs
2012-07-22 19:30:51 -04:00
adam-m
c12bb2a75b
minor string buffer optimization
2012-07-19 17:40:01 -04:00
adam-m
ba518de7c8
Add local Tika extract timeout mechanism, similar to that used for Solr indexing
2012-07-18 15:06:53 -04:00
adam-m
1fad291255
TSK-546 Extracted content Arabic files issue
...
fixes issue when content was escaped twice in some cases, if set node called multiple times on the same content (that should be looked at too)
2012-07-18 13:04:30 -04:00
adam-m
3251f1d65b
tika extract jpg not onyl jpeg ext.
2012-07-17 13:40:02 -04:00
adam-m
3dc0fc7b52
show meta info in last chunk only.
...
Handle unexpected unchecked exception separately.
2012-07-16 21:41:57 -04:00
adam-m
220946e240
- append and index meta-data to Tika extracted content
...
- attempt not to break words when creating chunks from Tika extracted text
2012-07-12 17:09:52 -04:00
adam-m
0c6a6a9776
If TIKA fails, do string extraction.
...
Code cleanup.
2012-07-11 13:38:17 -04:00
adam-m
9beced7ba4
Handle case when reader returns less than asked
...
Fix tika parsers dependencies for some files like MS Office
2012-07-10 17:43:59 -04:00
adam-m
fc4ecf0402
Better index timeout est based on actual byte size to ingest
2012-07-10 14:13:28 -04:00
adam-m
8f26cda926
TSK-519 Add support for files of known filetypes > 100 MB
...
(first take)
- also fix thunderbird module deps so they work with keyword search module
2012-07-10 14:05:35 -04:00
adam-m
66095ab336
fix max size of field
2012-06-28 13:52:24 -04:00
adam-m
d1fd8e7e63
Add toString() method for better logging
2012-06-28 13:49:07 -04:00
adam-m
7b6e6a4e19
fix sizing
2012-06-28 13:46:00 -04:00
adam-m
ed9dceb502
Always index meta data of known files (skip content), and 0 byte files
2012-06-28 13:34:08 -04:00
adam-m
8ba8775931
Enable label re-sizing
2012-06-28 13:25:51 -04:00
adam-m
ef8371b544
Update ingest manager proxy java docs
...
Add a method to from manager to the facade
2012-06-28 10:07:48 -04:00
adam-m
f8dfacc63a
Merge branch 'master' of https://github.com/sleuthkit/autopsy
2012-06-26 14:07:41 -04:00
adam-m
bd252890c2
Remove more doxygen warnings
2012-06-26 14:07:26 -04:00
Dick Fickling
0f4d01e238
GUI tweaks for Hash and Keyword configuration
2012-06-26 13:34:39 -04:00
adam-m
71b5006906
Merge branch 'master' of https://github.com/sleuthkit/autopsy
2012-06-22 16:27:27 -04:00
adam-m
9f30cf333b
Use default field
2012-06-22 16:27:12 -04:00
Dick Fickling
ede0326ab7
Make text consistent between hash and keyword search configuration
2012-06-22 15:03:22 -04:00
Dick Fickling
4e4f21e0a7
Fix TSK-486: Rename ".." to be "[parent folder]" in tree
2012-06-20 16:57:25 -04:00
Dick Fickling
fa7a292640
Ordering/prioritizing Data Content Viewers
2012-06-20 13:12:37 -04:00
adam-m
12d757542f
Extracted text viewer - cache last text content for much quicker loading when user browses artifacts for the same content
2012-06-20 12:35:45 -04:00
adam-m
17030ab360
Provide a separate method for reporting number of all solr documents, and number of files (not chunks only).
...
Do not report number of chunks to the user, only number of files/directories.
2012-06-19 11:20:18 -04:00
adam-m
d445cf9af8
Keyword search: add general tab to configuration
2012-06-19 11:08:04 -04:00
adam-m
6afd841c9d
Moved updatekeywords() to search thread to eliminate need of synchronizing ingest and search threads
2012-06-13 16:57:07 -04:00
Dick Fickling
8b36b631f7
Tighter & faster serialization for keyword search lists
2012-06-13 11:24:10 -04:00
adam-m
097e03bc60
Extracted text viewer: deactivate for directories (they have no text content)
2012-06-13 10:59:42 -04:00
Dick Fickling
12f260222b
Keyword search edit list panel show only unlocked lists
2012-06-13 10:02:54 -04:00
adam-m
0e268cb166
TSK-504 Hide "locked" lists from list management panel
2012-06-12 14:56:52 -04:00
adam-m
08edc0737e
Lower Solr doc cache settings to 16 docs
2012-06-11 11:49:19 -04:00
adam-m
c2909727a5
Revert to allow multiple bb writers threads in user driven GUI keyword search
2012-06-11 10:27:30 -04:00
adam-m
b9e23ab9ec
-Minimize solr query mem usage:
...
Change keyword search non-ingest query to enqueue snippet query threads, not to execute multiple snippet query threads in parallel
-Add snippet for regex query to GUI
2012-06-08 18:19:20 -04:00
adam-m
1fe82e3401
change variable name
2012-06-08 18:15:00 -04:00
adam-m
6c3472918e
make column names more consistent with the blackboard result viewer
2012-06-08 15:31:56 -04:00
Dick Fickling
21aeab110e
Merge branch 'master' of github.com:sleuthkit/autopsy
2012-06-08 12:20:13 -04:00
Dick Fickling
a594e7c954
Disable searching for phone numbers and ip addresses by default
2012-06-08 12:19:25 -04:00
adam-m
7dfb6905dd
Restore showing file/dir info in search results (for alloc files/dirs)
2012-06-08 11:21:59 -04:00
adam-m
46b48ac121
update comment
2012-06-08 11:21:30 -04:00
adam-m
0f484c3d77
Fix check if has background tasks running /pending
2012-06-08 11:21:10 -04:00
adam-m
008d11c494
Merge branch 'master' of https://github.com/sleuthkit/autopsy
2012-06-08 09:58:41 -04:00
adam-m
1fc68bc5e3
keyword search ingest fix concurrency issue on keyword list structure when final searcher starts, update javadoc
2012-06-08 09:58:25 -04:00
Dick Fickling
3b10a8e9e8
Support importing Encase lists, preserve hierarchy
2012-06-08 09:53:19 -04:00
Dick Fickling
98f2b540ce
Add delete key handler to Lists Management panel
2012-06-08 09:52:57 -04:00
Dick Fickling
0232dac2ea
Remove callback from keyword panel to ingest servece.
...
Service proactively gets new keywords, to updating manually was unnecessary
2012-06-08 09:52:34 -04:00