Giter VIP home page Giter VIP logo

decalage2 / exefilter Goto Github PK

View Code? Open in Web Editor NEW
63.0 63.0 28.0 2.59 MB

ExeFilter is an open-source tool and framework to filter file formats in e-mails, web pages or files. It detects many common file formats and can remove active content (scripts, macros, etc) according to a configurable policy.

Home Page: http://www.decalage.info/exefilter

License: Other

Python 88.26% Batchfile 0.09% HTML 11.65%

exefilter's People

Contributors

decalage2 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

exefilter's Issues

Error on some html files

On some html file the program gives this error:

HTML Document : Error while cleaning (set_cdata_mode() takes exactly 2 arguments (1 given))
< Both on 1.1.3 and alpha versions

Python2 deprecated

Hello,
as Python 2 is deprecated, is Python 3 support planned ?
For example in Arch Linux Python 2 was recently removed.

Use ghostscript / pdfwrite to flatten PDF is dangerous ? ( i read a lot of ghostscript vulnerabilities in the past )

I tried to use pdfclean.rb with Ruby but it doesn't work ( maybe i have the wrong version of Ruby ).
I see that in Origami there is pdf2pdfa, is it different from the pdfclean.rb script ?

As alternative i am trying to use qpdf + pdfid -d ( to uncompress with qpdf and disarm javascript in pdfid with Python 3 )
I see often that ObjStm is more than zero, can ObjStm contain Javascrip or other code ?

I use often exefilter to clean PDF and it works very well :)
Thank you !
Andrea

ExeFilter incorrectly flagged as malware

It looks like some antivirus are now flagging ExeFilter releases (and even the master zip from the github repo) as malware. It is obviously a false positive, and I guess it's due to some test files bundled with ExeFilter.
Solution: remove test files, or zip them with a password.

Virustotal is currently OK with the master zip from github (but my corporate AV blocks it):
https://www.virustotal.com/gui/url/1bd6773a63c33e984ea75642a60986b84c4005325f805662c725be34ea844909?nocache=1
But the latest release from 2011 on Adullact triggers some AV:
https://www.virustotal.com/gui/url/86504856a9901133a900ecc706a37213d0de28e527ac7af1f1c1ae80c2a2dd9a
image

Trouble removing Word macros

Hi,

I'm attempting to use ExeFilter.py to remove Word macros on Mac and Linux. When I run ExeFilter.py against a .doc file with macros it reports that it cleaned the file but scanning the file with Clamav still reports Heuristics.OLE2.ContainsMacros FOUND.

I've tested this with both version 1.1.3 and version 1.1.4-alpha6 and get the same results. Could you advise as to whether complete macro removal is possible with ExeFilter such that Clamav would not report an error?

Thanks,

Ashlin.

Active pdf (javascript) not detected

This pdf has a simple javascript that displays a popup when opening the file.
It has been created with Acrobat on windows, no manual manipulation and no special tool.

Exefilter (pdfid) does not detect the /JavaScript and the /JS tags, therefore they are not disarmed.
These tags are embedded in an obj within the pdf.

By uncompressing the file with qpdf --stream-data=uncompress This_is_an_active_PDF_esg.pdf uncompressed.pdf we can observe the embedded object:

<< /Type /ObjStm /Length 1462 /N 8 /First 50 >>
stream                                                                                                                                              
5 0 6 127 7 151 8 182 9 280 10 535 11 919 12 1180
<< /DA (/Helv 0 Tf 0 g ) /DR << /Encoding << /PDFDocEncoding 29 0 R >> /Font << /Helv 27 0 R /ZaDb 28 0 R >> >> /Fields [ ] >>
<< /JavaScript 7 0 R >>
<< /Names [ (Hello) 8 0 R ] >>
<< /JS (app.alert\({cMsg: "JavaScript action: hello", cTitle: "demo title"}\);) /S /JavaScript >>
<< /Ascent 1026 /CapHeight 632 /Descent -312 /Flags 32 /FontBBox [ -503 -312 1240 1026 ] /FontFamily (Calibri) /FontFile2 30 0 R /FontName /OKZOSG+C
<< /BaseFont /OKZOSG+Calibri /Encoding /WinAnsiEncoding /FirstChar 32 /FontDescriptor 9 0 R /LastChar 117 /Subtype /TrueType /ToUnicode 31 0 R /Type
<< /Ascent 1039 /CapHeight 632 /Descent -349 /Flags 32 /FontBBox [ -519 -349 1263 1039 ] /FontFamily (Calibri) /FontFile2 32 0 R /FontName /UBZYYU+C
<< /BaseFont /UBZYYU+Calibri-Bold /Encoding /WinAnsiEncoding /FirstChar 97 /FontDescriptor 11 0 R /LastChar 118 /Subtype /TrueType /ToUnicode 33 0 R
endstream
endobj

Can we do something about it?

Original pdf file:

This_is_an_active_PDF_esg.pdf

Uncompressed version:

uncompress.pdf

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.