Comments (4)
hello
there is a lot of security concerns with Kuiper not only vulnerable libraries (such as not proper authentication and authorization), if security is important i recommend to not connect it to enterprise network (use it in lab network and only allow specific users to access the lab network).
for the python3, I'm working on new release with complete rebuild from scratch with proper structure and new features, of course it is python3, currently I'm testing it and checking the licensing.
from kuiper.
Hey @salehmuhaysin,
It's not directly related to this issue but I wanted to bring something to your attention that could be worthwhile to add to your new release before it's out (in case you are still using MongoDB and didn't change the collection layouts).
We noticed a critical issue with the MongoDB files collection. Since metadata about all files that belong to a machine is stored in a single document, large machines with around 30k files become highly problematic. The max document size in MongoDB is 16 MB and it cannot be increased without resorting to tricks.
So I put some work into restructuring the collection to represent each file as a single document. This has the following advantages:
- Document size limit is never hit.
- The flattened design allows all queries to be directly executed on the database without having to do processing/filtering on the Python side, greatly increasing performance in many scenarios.
- The old code has concurrency problems if multiple tasks are running for the same machine, e.g., updates can be lost. The new code does not have this problem because it expresses everything through atomic db operations.
You can find the changes here: https://github.com/GDATAAdvancedAnalytics/Kuiper/tree/mongodb-revamp - feel free to apply it to your code base or take inspiration from it. :)
I don't want to open a PR for this for Kuiper v2 because it's a breaking change for existing deployments. Theoretically it would be feasible to integrate it, but it'd require some fancy automatic migration mechanism for existing data.
from kuiper.
hello,
There are a lot of issues with the structure of Kuiper, specially the mongoDB database and flask which make adding new features of fixing issue very hard.
mongoDB is no longer in the new release since it is not efficient to do simple things like relationship and has other issues for scaling,
the new release will be commercial in cloud with more features added including full API, user management, etc. it should be released soon.
still will continue the Kuiper as open source but the new one will be different since it is restructured from scratch
from kuiper.
Related Issues (20)
- Search fails when selecting time range HOT 6
- Kuiper modifies meaning of parsed data (quoting) HOT 2
- KAPE: Error extracting the archive content: compression type 9 (deflate64) HOT 1
- ZIP files generated in Windows (e.g. 7-zip) are not processed properly HOT 1
- IIS Access Logs Parser failed because of 'utf8' codec HOT 2
- "Powershell_Execution" rule does not catch "-encodedcommand" HOT 2
- Security Vulnerability Report: Open Redirect in Login HOT 1
- After installing v2.3.5 - still v2.3.4 displayed HOT 1
- Raw file removal code commented in recent commit HOT 4
- Search broken on large shard/index HOT 1
- v2.3.3 Issue: kuiper_flask is constantly restarting and does not start on HOT 2
- Tag multiple rows HOT 2
- Defender Detection History Parser HOT 1
- Celery and Flask restarting-loop HOT 2
- future to add .tar to be upload HOT 2
- Worker exited prematurely: signal 9 (SIGKILL) Job: 3. HOT 1
- ZIP files not uploading HOT 1
- Authentication Feature HOT 4
- JumpList and Browser_History Parsing ERROR HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kuiper.