psal / jstylo-anonymouth Goto Github PK
View Code? Open in Web Editor NEWThis project forked from evllabs/jgaap
Java-based Authorship Recognition Analysis and Evasion Tools
Home Page: http://psal.cs.drexel.edu/
License: Other
This project forked from evllabs/jgaap
Java-based Authorship Recognition Analysis and Evasion Tools
Home Page: http://psal.cs.drexel.edu/
License: Other
The COPYRIGHT file currently reads:
JSAN, JStylo and Anonymouth are copyright 2011 by the Privacy, Security and Automation lab at Drexel University.
All rights reserved; all unauthorized use or modification prohibited.
Please make it mention AGPL. Currently it sounds like I may "barely look at it".
PS: This is about branch anonymouth_master.
JSANGUIChooser.java reads:
try {
UIManager.setLookAndFeel("com.sun.java.swing.plaf.windows.WindowsLookAndFeel");
} catch (Exception e) {
System.err.println("Look-and-Feel error! (Not on Windows)");
}
This is definitely missing support for the GTK+ look and feel -- Metal hurts my eyes :-)
This relates to branch anonymouth_master.
$ java [..] edu.drexel.psal.GUI.JSANGUIChooser
[..]
23-18-58: Reading CumulativeFeatureDriver from [..]/./jsan_resources/feature_sets/writeprints_limited_norm.xml
failed!
failed!
failed!
failed!
[..]
Hi there,
I believe you're not working on the project anymore, but I hope you could still help me out with the issue I have! It is sort of the same issue that dan-blanchard posted about in 2012.
The analysis using WritePrints (Limited) takes up a very long time. I am using it on a very small corpus: training data consisting of only 3 texts from 2 authors and I have 4 test documents.
I am not very familiar with programming, so I followed your basic JStylo instructions. I run the WritePrints (Limited) feature set together with the SMO classifier. However, I had it running for at least 24 hours, but still not finished. When I use the 9-feature set, I have no problems whatsoever and the anlysis is done in a couple of minutes.
Perhaps you could help me? I would really want to use the WritePrints feature set, since it provides more accurate results. The only thing I know that it is not my memory of my laptop, since it takes up only 15% of my working memory..
Any ideas on how to solve the problem or suggestions on how to find out where to find the source of the issue?
Thanks in advance!!
$ javac ... [..]/edu/drexel/psal/jstylo/GUI/AnalysisTabDriver.java
[..]/edu/drexel/psal/jstylo/GUI/AnalysisTabDriver.java:35: warning: TypeMismatchException is internal proprietary API and may be removed in a future release
import com.sun.corba.se.impl.io.TypeMismatchException;
It's not just that file, though.
PS: Affects branch anonymouth_master.
I'm trying to process a set of 10,000 files using JStylo (with 12 possible authors), and it takes an extremely long time to generate the features (using the WriteLimits set). I've had it running for over three weeks, and unfortunately I now have to restart the process because I realized there was some metadata in the files that I did not want there.
Anyway, is there any advice you can offer on how to speed things up?
According to top, it seems that only one thread was running for most of the past three weeks, so are you planning on making the feature-generation code more multithreaded in the future?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.