wheresvic / zsearch Goto Github PK
View Code? Open in Web Editor NEWA high performance search engine
A high performance search engine
The following non-source files are included in the repository:
leveldb/build_config.mk
leveldb/libleveldb.dylib
leveldb/libleveldb.dylib.1
leveldb/libleveldb.dylib.1.16
They should be removed + ignored.
A couple of the makefiles use g++ as the compiler. I think it is better to let the user define the compiler (e.g., by using $(CXX)).
Will probably need to implement swap as we can't return the abstract Set from the current CompressedSet::removeDoc() function ...
Maybe push to second release?
Or simply use some library to escape xml entities before returning it ...
Seems to blow up with rapid xml :(
might also need to change leveldb allocator
LIBEVENT_TARGET:
$(MAKE) -C libevent/make clean
We need to collect for each blocking operation typ
Need to save on RAM!
This look very good
http://highlandsun.com/hyc/mdb/microbench/
Do we really need this?
Also how should we handle empty fields?
LazyAndSet does not seem to work with BasicSet. Could be an issue in either, will need to write some unit tests to make sure. Mostly likely bug is in BasicSet ...
We can reserve id 0 to mark the start of iteration, a docID of 0 is invalid ... they will always start with 1
Make compile time faster - you can leave the header only files as a separate copy and we can change the source to inline it for performance in a production build (if it actually makes a difference).
It seems to work but it breaks the tokenizer ...
./src/server.cpp refers to some header files in a subdirectory "event2" which are not included.
Need to add support in the Engine, test, add support in the server and the API. We can perhaps leave this for a second release - let's focus on creating a release and the website for now :)
The getString functions are fragile. I am getting errors about
one of them trying to override another.
It feels like it would be better and simpler to do:
template <class T>
static string getString(T v)
{
std::stringstream ss;
ss << v;
return ss.str();
}
(Arguably, it is maybe not the fastest way to do these conversions.)
recreate the xml files but using a proper library ... or just dump text with the title as the file name
"total+area+of+9.7%A0square+miles" does not convert back correctly
Basically in case you're trying to run two instances of the server. We could consider adding parameters for db files etc, but this might make the startup have too much overhead.
we should something portable like leveldb::port::Mutex
see InvertedIndexBatch.hpp for an example
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.