Giter VIP home page Giter VIP logo

linto-ai / linto-studio Goto Github PK

View Code? Open in Web Editor NEW
18.0 4.0 0.0 63.38 MB

Transcription and annotation interface for recorded audio or video files

License: GNU Affero General Public License v3.0

JavaScript 46.15% Dockerfile 0.11% Shell 0.81% Kotlin 0.01% Swift 0.12% Objective-C 0.01% Dart 11.51% CMake 1.12% C++ 1.39% C 0.08% HTML 0.64% CSS 0.88% SCSS 9.63% Vue 27.55%
asr audio-transcription stt video-transcription transcription-edition caption captioning-videos subtitle subtitles virtual-scribe

linto-studio's Introduction

LinTO Studio

Open Source AI driven recording, transcription and media management solution
DemoGet the android application

screenshot of linto.app

Features

  • Media management platform
  • Advanced transcription features (speaker identification, ...)
  • Auto timestamp alignement
  • NLP and AI features
  • Closed caption edition

With a companion app

drawing
  • Record on the go
  • Synchronize your media with LinTO studio

Install and use

docker-compose up -d

then open your web browser to http://localhost:8003

Configuration

SMTP configuration

You can configure an smtp server, for improving sharing and account verification.

SMTP_HOST=mail.example.com
SMTP_PORT=465
SMTP_SECURE=true
SMTP_REQUIRE_TLS=true
SMTP_AUTH=username
SMTP_PSWD=password
[email protected]

Transcription service

By default LinTO Studio uses the LinTO API Gateway to do transcription. To use Studio with another gateway, set environnement variables

GATEWAY_SERVICES="API Gateway service endpoint"

linto-studio's People

Contributors

capmartinleo avatar damienlaine avatar houpert avatar jslinagora avatar kqtethompson avatar linagora-linto avatar rlopezdev avatar tjiho avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

linto-studio's Issues

Conversation Manager public beta

Deployment of Conversation Manager in public beta, packing up developed features

  • Public availability for front-end an websocket server source code and Docker Images
  • developers would run Docker images on premise an use localy running Cognitive APIs or public LinTO cognitive APIs (SaaS)
  • Relevant documentation
  • Product landing page

zero organization leads to blank interface

Describe the bug
The website shows blank interface (empty page) once the user account is associated to 0 organization.

To Reproduce
Steps to reproduce the behavior:

  1. Go to "create an account", at this step, a personal organization will also be created.
  2. Login to the main interface: https://alpha.linto.ai/interface/conversations
  3. Go to "organization setting", and delete the created organization (there should be only 1 organization).
  4. See error, empty webpage with zero UI components.

Expected behavior
I guess the conversation-manager assumes the user should have at least 1 organization.
So the "delete organization" should not be allowed if this is the case.

Transcription disparue

Describe the bug
La transcription de la conversation a disparue lors de la migration (page blanche)

https://convos.linto.ai/interface/conversations/63c18e3373518f00190ac634/transcription

To Reproduce
Steps to reproduce the behavior:

  1. Go to '...'
  2. Click on '....'
  3. Scroll down to '....'
  4. See error

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your problem.

Additional context
Add any other context about the problem here.

Conversation Manager fails on file with no audio

Bug description
On a small audio file of 3 sec with no audio content, the conversation fails with "Erreur : une erreur s'est déclenchée pendant le traitement de l'audio"

To Reproduce
Steps to reproduce the behavior:

  1. Go to https://convos.linto.ai
  2. Click on "Créer une conversation"
  3. Choose a wav file with no audio in it. Like this one empty.zip
  4. Do not choose any particular service (no punctuation, no speaker identification)
  5. Click on "Créer la conversation"
  6. Wait a bit and click on the conversation

Expected behavior
It should either appear as an empty conversation (where you can just play the audio file).
Or the failure message should be more explicit.

Screenshots
Screenshot from 2023-01-12 17-57-32

Suggestion: rename "Identification des locuteurs" / "Speaker identification"

"Identification des locuteurs" or "speaker identification" refers to the process of recovering the identity of the speakers (assuming one can link the identity of a speaker to a set of audio recordings / to a model).
It's different from the diarization task.

I suggest to rather use "Segmentation en locuteurs" / "Speaker segmentation".

image

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.