Marek Otahal
f40efb39af
Blacklist loadList() remove duplicates by using Set
...
Signed-off-by: Marek Otahal <markotahal@gmail.com>
2012-01-09 01:18:01 +01:00
Marek Otahal
f75b5e40e0
little fix in copy()
...
Signed-off-by: Marek Otahal <markotahal@gmail.com>
2012-01-09 01:16:46 +01:00
Marek Otahal
1dc5d9f0f3
make ConnectionInfo comparable and sort list of connections in Connections_p
...
ConnectionInfo compare by initTime
Connections_p implement wish to sort connections, descending
Signed-off-by: Marek Otahal <markotahal@gmail.com>
2012-01-09 01:14:41 +01:00
Michael Peter Christen
91940fdf56
redesign of WordCache to be prepared to hold multiple
...
independent dictionaries. Such dictionaries can then be also used as
simplified vocabularies.
2012-01-08 00:47:32 +01:00
Michael Christen
bd40a10230
added autotaggig stub .. only reading and parsing of vocabularies at
...
this time
2012-01-07 17:34:38 +01:00
Michael Peter Christen
2ee8cbeb2c
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
...
Conflicts:
source/net/yacy/search/Switchboard.java
2012-01-05 18:37:46 +01:00
Michael Peter Christen
992dbdf4bb
added noload statistic to servlets
2012-01-05 18:33:05 +01:00
Michael Christen
eebc02f5c1
fix
2012-01-04 20:24:48 +01:00
Michael Christen
216a287a85
Merge commit '6d4e08ed06c5cd28c45981b2ebe31c7f7ec6fd83' into quix0r
...
Conflicts:
source/de/anomic/crawler/CrawlQueues.java
2012-01-04 20:16:37 +01:00
stbrumm
d18095dc48
Patch fuer Issue 0000102
...
and fixes to Patch (private peer status is a property of a peer, not a
status)
2012-01-03 17:49:37 +01:00
stbrumm
9f1b1b4604
Type for Robinson-Mode/Private Perr added
2012-01-03 14:43:17 +01:00
Michael Christen
20962a4ed7
added metadata node stub for metadata from blobs
2012-01-03 14:38:03 +01:00
Michael Christen
575dbbaa93
enhancements in Blob retrieval: try to use less CPU resources by testing
...
a blog first that most certainly has wanted entries.
2012-01-02 02:14:05 +01:00
Michael Christen
585a8f3c44
fixed a bug in search sequence (caused emtpy results)
2012-01-02 02:10:39 +01:00
Michael Christen
361146dd7a
better error handling for file loader
2011-12-29 14:37:19 +01:00
Roland 'Quix0r' Haeder
6d4e08ed06
Rewrote filesize() to (hopefully) avoid a NPE, rewrote Blacklist class to concurrent classes to avoid a CME
2011-12-29 03:42:38 +01:00
Roland 'Quix0r' Haeder
fa08ed5ae5
Fixed a lot CHMOD rights (no need for execute flag on *.java/*.html) and introduced local/remote crawl size ratio based check
2011-12-29 00:33:16 +01:00
Roland Haeder
319fd1f4aa
A concurrent access can happen on the blacklist (with latest introduced blacklist check in media snippet computation)
2011-12-28 21:40:44 +01:00
Roland 'Quix0r' Haeder
a3083d13bf
Blacklist checks are now always turned on, in media searches (e.g. image search) images matching blacklist entries are no longer shown to the user
2011-12-28 20:09:17 +01:00
Michael Christen
52184a1170
fix for search process
2011-12-27 23:43:44 +01:00
Michael Christen
85bd4cc8bc
better lookup for peer names
2011-12-25 10:14:15 +01:00
Michael Christen
20e3084bd4
redesign of fining of peers by ip: more leightweight method to read the
...
seed databases
2011-12-21 01:14:43 +01:00
Michael Christen
0797b0de99
new handling of remote search processes: looking for seeds will now not
...
block the whole search process any more. A deadlock with a DHT selection
process may have been the cause for interface lockings in the past.
2011-12-21 00:32:03 +01:00
Michael Christen
ee9aae5cc0
more about CreativeCommons license vocabulary
2011-12-18 16:07:51 +01:00
Michael Christen
ecd74fe34f
less dramatic upnp failures
2011-12-18 09:54:08 +01:00
Michael Christen
c75e1a3125
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
2011-12-18 08:20:10 +01:00
Michael Christen
13f5b5f80d
the component part in the YaCy Metadata is filled using the Dubling Core
...
vocabulary
2011-12-18 08:19:48 +01:00
Michael Peter Christen
8d2cbfb685
more vocabularies and more semantics for lod data structures
2011-12-18 08:12:34 +01:00
Michael Christen
9cd36b4c44
added vocabulary for geolocalization as used in georss
2011-12-17 02:03:45 +01:00
Michael Christen
9e5894c784
Removed handling of components objects for URIMetadataRows.
...
This is a preparation to replace this rows with nodes from the node
store.
2011-12-17 01:27:08 +01:00
Michael Christen
66ab51f89d
added rdf vocabulary
2011-12-17 01:09:16 +01:00
Michael Christen
c04bfaa51b
refactoring
2011-12-16 23:59:29 +01:00
Michael Peter Christen
136b514f52
added a Triple Store based on Nodes that fit to the new storage classes.
...
Added also a first Vocabulary for the node store - Dublin Core.
2011-12-16 23:01:47 +01:00
Michael Peter Christen
613ab6a69d
added BEncodedHeapBag and BEncodedHeapShard which are storage container
...
for a new metadata store. An abstraction of the content for this storage
is defined with MapStore. A MapStore is an abstraction of a RDF Node
store.
2011-12-16 23:00:50 +01:00
Michael Christen
6fecd0db88
one more performance hack to prevent costly md5 computation
2011-12-15 23:33:41 +01:00
Michael Christen
e13441b069
better digest pool size (smaller by default but unlimited)
2011-12-15 17:45:46 +01:00
Michael Christen
1f4afb4dc0
performance hacks
2011-12-15 15:15:53 +01:00
Michael Christen
675d557e88
removed debug logging
2011-12-14 22:21:19 +01:00
Michael Christen
e9dc99fe15
added rules to set specific RWIs as private RWIs which are not
...
transmitted to remote peers. This will be used for private index copies
and phonetic indexes.
2011-12-14 22:15:51 +01:00
Michael Peter Christen
4243ace863
added phonetic classes
2011-12-14 17:33:18 +01:00
Michael Peter Christen
0bcef2d156
added feature as requested in
...
http://forum.yacy-websuche.de/viewtopic.php?f=18&t=3461
The search can now be configured with a non-display host list.
the search will always exlude the given list of host unless they are
requested directly using the host navigation
2011-12-13 00:16:05 +01:00
Michael Christen
204c29f010
small bugfixes for search result display and cache display
2011-12-10 01:35:38 +01:00
Michael Christen
17f962fceb
translator updates:
...
- config string for chinese
- do not copy the language file to DATA/LOCALE any more (and do not use
them there, this is really confusing for new translators)
2011-12-08 10:25:26 +01:00
Michael Christen
078fcde0dd
bad initialization
2011-12-07 01:02:23 +01:00
Michael Christen
14e45e90fd
patch for a bug that I don't understand by now.
2011-12-07 00:52:04 +01:00
Michael Christen
3eccdca63c
protection against too long running snippet fetch processes
2011-12-07 00:32:42 +01:00
Michael Christen
86b3385847
fixed a deadlock during secondary remote search
2011-12-07 00:18:34 +01:00
Michael Christen
c715d19c09
fixes for dependency on svn
2011-12-06 22:05:22 +01:00
Michael Christen
404758698a
less io operations
2011-12-06 22:04:34 +01:00
Michael Christen
0bc5d76bee
ups
2011-12-06 21:55:50 +01:00