linto-ai / linto-studio Goto Github PK

Transcription and annotation interface for recorded audio or video files

License: GNU Affero General Public License v3.0

JavaScript 46.15% Dockerfile 0.11% Shell 0.81% Kotlin 0.01% Swift 0.12% Objective-C 0.01% Dart 11.51% CMake 1.12% C++ 1.39% C 0.08% HTML 0.64% CSS 0.88% SCSS 9.63% Vue 27.55%

asr audio-transcription stt video-transcription transcription-edition caption captioning-videos subtitle subtitles virtual-scribe

linto-studio's Introduction

LinTO Studio

Open Source AI driven recording, transcription and media management solution
Demo • Get the android application

Features

Media management platform
Advanced transcription features (speaker identification, ...)
Auto timestamp alignement
NLP and AI features
Closed caption edition

With a companion app

Record on the go
Synchronize your media with LinTO studio

Install and use

docker-compose up -d

then open your web browser to http://localhost:8003

Configuration

SMTP configuration

You can configure an smtp server, for improving sharing and account verification.

SMTP_HOST=mail.example.com
SMTP_PORT=465
SMTP_SECURE=true
SMTP_REQUIRE_TLS=true
SMTP_AUTH=username
SMTP_PSWD=password
[email protected]

Transcription service

By default LinTO Studio uses the LinTO API Gateway to do transcription. To use Studio with another gateway, set environnement variables

GATEWAY_SERVICES="API Gateway service endpoint"

linto-studio's People

Contributors

Stargazers

Watchers

linto-studio's Issues

Conversation Manager public α availability https://convos.linto.ai

convos.linto.ai with wide open Cognitive APIs
Quite buggy

Conversation Manager public beta

Deployment of Conversation Manager in public beta, packing up developed features

Public availability for front-end an websocket server source code and Docker Images
developers would run Docker images on premise an use localy running Cognitive APIs or public LinTO cognitive APIs (SaaS)
Relevant documentation
Product landing page

zero organization leads to blank interface

Describe the bug
The website shows blank interface (empty page) once the user account is associated to 0 organization.

To Reproduce
Steps to reproduce the behavior:

Go to "create an account", at this step, a personal organization will also be created.
Login to the main interface: https://alpha.linto.ai/interface/conversations
Go to "organization setting", and delete the created organization (there should be only 1 organization).
See error, empty webpage with zero UI components.

Expected behavior
I guess the conversation-manager assumes the user should have at least 1 organization.
So the "delete organization" should not be allowed if this is the case.

Propose menu for Speech processing > keyword auto-highlight

Introduce feature menu "Speech Processing"
Use relevant NLP API for Keyword identification
Associate them to a highlight with a "keyword" type

Transcription disparue

Describe the bug
La transcription de la conversation a disparue lors de la migration (page blanche)

https://convos.linto.ai/interface/conversations/63c18e3373518f00190ac634/transcription

To Reproduce
Steps to reproduce the behavior:

Go to '...'
Click on '....'
Scroll down to '....'
See error

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your problem.

Additional context
Add any other context about the problem here.

Selecting English as language on Studo.linto.app does not list any services , only french does.

Suggestion: Undo last modifications

When editing a conversation, I show no way to undo the last modifications.
This could be useful.

Conversation Manager fails on file with no audio

Bug description
On a small audio file of 3 sec with no audio content, the conversation fails with "Erreur : une erreur s'est déclenchée pendant le traitement de l'audio"

To Reproduce
Steps to reproduce the behavior:

Go to https://convos.linto.ai
Click on "Créer une conversation"
Choose a wav file with no audio in it. Like this one empty.zip
Do not choose any particular service (no punctuation, no speaker identification)
Click on "Créer la conversation"
Wait a bit and click on the conversation

Expected behavior
It should either appear as an empty conversation (where you can just play the audio file).
Or the failure message should be more explicit.

Screenshots

Suggestion: rename "Identification des locuteurs" / "Speaker identification"

"Identification des locuteurs" or "speaker identification" refers to the process of recovering the identity of the speakers (assuming one can link the identity of a speaker to a set of audio recordings / to a model).
It's different from the diarization task.

I suggest to rather use "Segmentation en locuteurs" / "Speaker segmentation".

linto-ai / linto-studio Goto Github PK

linto-studio's Introduction

LinTO Studio

Open Source AI driven recording, transcription and media management solution Demo • Get the android application

Features

With a companion app

Install and use

Configuration

SMTP configuration

Transcription service

linto-studio's People

Contributors

Stargazers

Watchers

linto-studio's Issues

Recommend Projects

Recommend Topics

Recommend Org

Open Source AI driven recording, transcription and media management solution
Demo • Get the android application