Giter VIP home page Giter VIP logo

static-files's Introduction

Overview

A collection of static files maintained by the Sublime team, primarily used for phishing defense.

Repo contents

  • alexa_top_1m.csv: Alexa top 1 million sites.
  • disposable_email_providers.txt: Disposable (or temporary) email providers that generate short-lived email addresses not owned by or attributable to any single user.
  • file_extensions_common_archives.txt: Common archive file extensions, which can be used to deliver malicious files. Uncommon archive extensions are not included in this list because they can typically be blocked holistically.
  • file_extensions_macros.txt: File extensions of macro-supported documents, such as Microsoft Office files. Also includes file types that can load macros indirectly, eg via remote template injection
  • free_email_providers.txt: Free email ("freemail") providers that allow anyone to create an email address. This is important to ensure the email address does not inherit the reputation of the freemail provider's domain. This list also includes the contents of the disposable_email_providers.txt list.
  • free_file_hosts.txt: Sites that allow anyone to upload and serve arbitrary files or content. Content can include anything from a file preview with a clickable link to rendering HTML. This may include sites that uses arbitrary subdomains to load balance sites, but those subdomains are re-used for the site's users and are not unique to the owner of the content.
  • free_subdomain_hosts.txt: Sites that allow anyone to create their own subdomain and host arbitrary content. This is important to ensure the subdomains do not inherit the reputation of the root domain. This includes both subdomains that are auto-generated as well as user-named subdomains. Subdomains are unique to the user that created it, and not re-used by the site for other users.
  • majestic_million.csv: Majestic million is a collection of domains that have the most referring subnets.
  • suspicious_content.txt: Words or phrases that may be considered suspicious in an email body.
  • suspicious_subjects.txt: Words or phrases that may be considered suspicious in an email subject.
  • suspicious_subjects_regex.txt: Regular expressions for words or phrases that may be considered suspicious in an email subject.
  • suspicious_tlds.txt: Top-level domains that are either frequently abused, free to register, or otherwise not generally used in the normal course of business or email communication.
  • tranco.csv: Tranco ranking, a research-oriented top sites ranking hardened against manipulation.
  • umbrella_top_1m.csv: Cisco Umbrella top domains based on passive DNS data.
  • umbrella_top_1m_tld.csv: Cisco Umbrella top TLDs based on passive DNS data.
  • url_shorteners.txt: Known URL shorteners that allow anyone to host arbitrary content.

Shoutout to @SwiftOnSecurity for SwiftFilter which inspired many of the suspicious subjects and content.

static-files's People

Contributors

jkamdjou avatar ianthiel avatar morriscode avatar aidenmitchell avatar hugh-sublime avatar rw-access avatar bfilar avatar jtb6 avatar itsojon avatar computermadscientist avatar zoomequipd avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.