Giter VIP home page Giter VIP logo

localplans-analysis's Introduction

Data analysis for local plans

Experimenting with applying AI to Local Plans produced by planning authorities in England.

This code analyses Local Plan PDFs and adds data to a vector store

The repo also contains some noodling on ideas.

See https://github.com/jimmytidey/localplans-server for a front end

Setting up the postgres DB

-- !!!! Installing the vector extension took nearly two hours.
-- !!!! It would probably be best to run it from Heroku dataclips CREATE EXTENSION IF NOT EXISTS vector;

CREATE TABLE IF NOT EXISTS text_fragments ( id SERIAL PRIMARY KEY, metadata JSONB, vector VECTOR(384), plain_text TEXT, full_text tsvector, hash TEXT, filename TEXT

);

-- Create a text search configuration for English CREATE TEXT SEARCH CONFIGURATION IF NOT EXISTS english_simple_config ( COPY = simple );

ALTER TEXT SEARCH CONFIGURATION english_simple_config ALTER MAPPING FOR asciiword, asciihword, hword_asciipart, word, hword, hword_part WITH english_stem;

-- Alter the text column to use the English full-text search configuration ALTER TABLE text_fragments ALTER COLUMN full_text SET DATA TYPE tsvector USING to_tsvector('english_simple_config', text);

-- Create an index on the tsvector column for faster full-text searches CREATE INDEX IF NOT EXISTS text_tsvector_index ON text_fragments USING gin(full_text);

localplans-analysis's People

Contributors

jimmytidey avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.