The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploits syntactic information to compress it, and uses coreference constraints to ensure clarity.
I am currently using the Entity Preprocessing Driver main method to turn my regular .txt files into the (Conll?) format understood by this summarizer however I am getting issues at the moment with the ConllReader class used in the Summarizer class unable to parse some of the generated lines (in the assembleConstTree method because some lines appear to be missing a "*")
Would you be able to shed more light on the Conll format that the summarizer is expecting?
Kindly share a demo link of your project. I would like to test the summarization. Please let me know where I can get to see your project demo.
Thank you in advance.
I am trying to use your summarizer and refer your paper in my paper, but I got an exception as the following:
I am using mac os. I am trying to set java jni but I always got an error. There will be /usr/local/lib/jni in mac os wrote in the readme, but I can't find any folder with jni in my mac. Could you please tell me how to set jni with mac? I appreciate your help. Thank you.
The joint model (COREF+NER+WIKI) of the Berkeley Entity Resolution System combines the output for all input documents (e.g. government.txt and music.txt) into a single file output.conll.
While the output produced by other models does not exactly match the test files in the Berkeley Document Summarizer (e.g. the last two columns of government.txt are off).
Would appreciate a clarification on the assumed data interface between the Berkeley Entity Resolution System and the Berkeley Document Summarizer.