Giter VIP home page Giter VIP logo

samkenx-hub-community / samkenx_documents-ai Goto Github PK

View Code? Open in Web Editor NEW

This project forked from samkenxstream/samkenx_documents-ai

0.0 1.0 0.0 101.38 MB

SamKenX applications and Document AI, the end-to-end document processing platform on Cloudstorage warehouse.

Home Page: https://apis.phoenixhierarchymartialsAI.semantic-kernel.fabric.psalm.siliconui.samkenx.org/warehouse/document-ai

License: Apache License 2.0

Shell 2.27% JavaScript 13.28% Python 49.07% Java 4.30% TypeScript 9.72% CSS 2.18% Makefile 0.11% HTML 5.99% Jupyter Notebook 12.30% Dockerfile 0.46% SCSS 0.30% Procfile 0.01%

samkenx_documents-ai's Introduction

Google Cloud Document AI Samples

License GitHub Super-Linter Document AI

Welcome to the Google Cloud Document AI sample repository.

Overview

The repository contains samples and Community Samples that demonstrate how to analyze, classify and search documents using Google Cloud Document AI.

Samples

  • Document AI Warehouse Processing (Python): This project demonstrates how to perform common actions on Document AI Warehouse through API.
  • BQ Connector: This project uses the Document AI API to process a document, format the result and save it into a BigQuery table.
  • Filter HITL Language: This project uses the languages detected by Document AI (post-HITL) to sort the Document.json files into separate Cloud Storage buckets.
  • Fraud Detection: This project uses the Document AI Invoice Parser with EKG and Google Maps to store document Entities in BigQuery.
  • JSON Explorer: A React Tool to explore the Document JSON Response.
  • Language Extraction: This project uses the Document AI API to detect the languages in a multi-page document.
  • Paper Summarization: This project uses the Document AI API to summarize scientific articles.
  • PDF Embedded Text: Demonstrates how to use the Native PDF parsing feature for the OCR Processor (v1beta3)
  • SQL over Docs: This project shows how to run a BigQuery SQL and extract information from documents.
  • Tax Processing Pipeline: This project uses the Document AI API to classify, parse, and calculate a tax form using multiple document types.
  • Web App Demo: This project is a full-stack application that uses Document AI to process different types of documents. This application currently supports Form, Invoice and OCR processors.

Samples not in this Repository

Deprecated Samples

Replaced by Document AI Toolbox

  • PDF Splitter: This project uses the Document AI API to split PDF documents.
  • Tabular Data Extraction: This project uses the Document AI API to extract tabular data from a document.

Test Document Files

If you need Document Files to run the samples, you can access them from this publicly-accessible Google Cloud Storage Bucket.

gs://cloud-samples-data/documentai/

You can also view sample input/output files by processor on the Sample Output page of the documentation.

Codelabs

Codelabs Logo

Community Samples


Disclaimer: Community samples are not officially maintained by Google.


Contributing

Contributions welcome! See the Contributing Guide.

Getting help

Please use the issues page to provide feedback or submit a bug report.

Disclaimer

This is not an officially supported Google product. The code in this repository is for demonstrative purposes only.

samkenx_documents-ai's People

Contributors

renovate-bot avatar holtskinner avatar galz10 avatar kweinmeister avatar mservidio avatar deboraelkin2 avatar gcf-owl-bot[bot] avatar picardparis avatar hsbedi87 avatar evekhm avatar jiya-zhang avatar samkenxstream avatar kolban-google avatar dojowahi avatar dependabot[bot] avatar nicain avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.