Commit Graph

8111 Commits

Author SHA1 Message Date
Michael Peter Christen
96cb75f1d4 made the filetype navigator be able to deselect the search constraint 2012-01-25 02:50:06 +01:00
Michael Peter Christen
9ebcae2fbc enhanced url parser to understand urls with & instead of & in post
urls
2012-01-25 02:42:24 +01:00
Michael Peter Christen
30891d026f added a remove-navigation for vocabularies 2012-01-25 00:22:51 +01:00
Michael Peter Christen
1f4f60654a Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
Conflicts:
	source/net/yacy/document/parser/pdfParser.java
2012-01-24 20:42:30 +01:00
Michael Peter Christen
d5ead5314d changed navigation links: now using checkboxes.
This looks better and allows that negative checkboxes (such that remove
the navigation) are possible. These are not yet implemented (comming
next)
2012-01-24 19:03:47 +01:00
reger
32104360ce PDFParser - return at least first 3 pages of PDF
fix for pdf parsing without returning parsed text due to interruption by
time out.
2012-01-23 20:58:36 +01:00
Michael Peter Christen
696ee5fc16 removed pdf from default parser deny list 2012-01-23 17:27:58 +01:00
Michael Peter Christen
ef5192f8c9 using the generic document parser for crawl starts instead of the html
parser. This makes it possible that every type of document can be a
crawl start point, not only text documents or html documents. Testet
this with a pdf document.
2012-01-23 17:27:29 +01:00
Michael Peter Christen
33a71a61fa Merge commit 'b60e2e952102c3eae40ab98c892a8c7d1b478345' 2012-01-23 11:20:32 +01:00
reger
b60e2e9521 PDFParser - return at least first 3 pages of PDF
fix for pdf parsing without returning parsed text due to interruption by time out.
2012-01-23 04:33:07 +01:00
Michael Peter Christen
a02fdf8625 better error messages 2012-01-23 00:47:25 +01:00
Michael Peter Christen
eadb58dd87 small enhancements in pdf parser 2012-01-23 00:46:02 +01:00
Michael Peter Christen
c6ba44468e timeout = 5000 instead 3000 2012-01-23 00:45:32 +01:00
Michael Peter Christen
44491ec6dd Merge commit 'b616de59735d33b922b7d2fbccdcc9031b77fa6e' 2012-01-22 19:08:27 +01:00
Lotus
d2ca33ccd7 Java update 2012-01-21 18:33:21 +01:00
Lotus
59ebee5de2 highlight changed row on api table 2012-01-21 18:29:53 +01:00
reger
b616de5973 PDFParser - return at least first 3 pages of PDF
fix for pdf parsing without returning parsed text due to interruption by time out.
2012-01-21 03:15:12 +01:00
Michael Peter Christen
ce620be783 for for crawl start with smb url 2012-01-19 23:07:15 +01:00
Michael Peter Christen
e6d26a023f fix for bookmark crash with possible side-effects on crawl start after
the crash
2012-01-19 23:06:09 +01:00
Lotus
18227790ca variable search works at least on Firefox again 2012-01-19 16:54:09 +01:00
Lotus
b7f6383ab8 win installer check at least for 4gb space. resource observer pauses
crawling at 3gb free space by default and informs user what to do. so
1gb can be used by yacy at full feature. 30gb warning comes quite often
on installation time. users are confused about it and may never reach
it.
2012-01-19 16:52:26 +01:00
Lotus
c73af39e54 refactoring of tray icon class,
now uses Java 6 methods natively
2012-01-18 20:47:09 +01:00
Michael Peter Christen
5afec4af39 fix for http://bugs.yacy.net/view.php?id=131 2012-01-18 11:58:51 +01:00
Michael Peter Christen
a30b028cc0 updated libraries 2012-01-18 01:21:41 +01:00
Michael Peter Christen
dc165275ad bugfix for usage of multiple vocabulary navigators 2012-01-18 01:00:30 +01:00
Michael Peter Christen
4eff0e26f1 npe bugfix 2012-01-17 23:39:57 +01:00
low012
8776b84c10 *) small fix to make password change function of reconfigureYACY.sh work
again
2012-01-17 20:43:19 +01:00
Michael Peter Christen
190b77c55e added Ukrainian translation 2012-01-17 17:45:28 +01:00
Michael Peter Christen
1a0b6b3913 get more navigation details to search results 2012-01-17 16:44:30 +01:00
Michael Peter Christen
7f9b6b7a0c added switches to ConfigParser to accept/deny documents by their
extension
2012-01-17 16:43:34 +01:00
Michael Peter Christen
4901cee3cc suppress auto-tagged subject entries when sending out or receiving
metadata from other peers
2012-01-17 02:10:05 +01:00
Michael Peter Christen
83009d86f7 added the vocabulary navigator. It can be very simply tested by
switching on the locale dictionaries.
2012-01-17 01:53:08 +01:00
Lotus
37d43e5589 jump back to where we want to modify the scheduler in api steering after
submit
2012-01-16 20:08:24 +01:00
sixcooler
985b78cf89 correct 'avaiable()' to use max of young / eden 2012-01-16 16:59:58 +01:00
Alexander Fieger
3db2b5f8e8 created an working free-form project for netbeans (7.1) 2012-01-16 09:05:10 +01:00
Alexander Fieger
a2a7359a7a removed not working netbeans project. This must be an free-form project and not an java project. 2012-01-16 09:01:14 +01:00
sixcooler
4da8746275 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2012-01-16 01:48:36 +01:00
sixcooler
c9aaa9e00a respect non-reserved Memory in GenerationMemoryStrategy
and enable it again
2012-01-16 01:46:12 +01:00
Michael Peter Christen
37f2d1b3e9 replaced Thread initialization with ExecutorService pool for delete
method. This is much faster and produces less blocking when using the
Compressor class which is used by the HTCache. I.e. picture search is
much faster now.
2012-01-16 01:05:30 +01:00
Michael Peter Christen
a58dc4a91f added autotagging to document condenser:
- tags that are automatically generated now enrich the dc:subject
- auto-generated tags have a '$' at the beginning of the tag
- auto-generated tags lead the tag name with a vocabulary name
each tag has the form
$<vocabulary-name>:<tag-printname-space-replaced-by-'_'>
2012-01-15 22:17:57 +01:00
Michael Peter Christen
0d6176804b emergency disabling of GenerationMemoryStrategy because of non-working
available-method
2012-01-15 21:58:18 +01:00
Lotus
411aab02e3 Windows installer now detects reliably whether YaCy runs. A file lock on
the yacy.running file has been implemented.
2012-01-15 19:01:05 +01:00
Michael Peter Christen
87f0210480 enriched log output to find NPE in HeapReader 2012-01-15 12:08:46 +01:00
Michael Peter Christen
26e227f17d Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 2012-01-13 11:28:48 +01:00
Michael Peter Christen
7053f8ab46 added automatic generation of a solr schema.xml file 2012-01-13 11:26:22 +01:00
Michael Peter Christen
987b412491 updated solr scheme: generic declaration of solr schemes 2012-01-13 11:25:15 +01:00
Michael Peter Christen
254adea51c small fixes 2012-01-13 11:24:08 +01:00
apfelmaennchen
c7f88f3fd1 fix for http://bugs.yacy.net/view.php?id=101 - the default crawl
depth for bookmarks is now editable.
2012-01-12 23:30:23 +01:00
Michael Peter Christen
49be60a7c8 WorkflowProcess is forced to make small pauses if shortMemoryStatus is
reached.
2012-01-10 03:03:12 +01:00
Michael Peter Christen
b7bb84c0bb set a limit to CharBuffer object size to fight against bad/too large
content
2012-01-10 03:02:17 +01:00