We generate segment-wise embeddings zt∈Z that can represent a unit segment of audio fr

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Dimension of Zt about neural-audio-fp HOT 7 CLOSED

mimbres commented on May 30, 2024

Dimension of Zt

from neural-audio-fp.

Comments (7)

mimbres commented on May 30, 2024

@kasireddygariDineshKumarReddy z(t) is of dimension d. In config file, EMB_SZ defines d.

neural-audio-fp/config/default.yaml

Line 47 in 058d812

EMB_SZ : 128 # Dimension of fingerprint, d in this paper.

from neural-audio-fp.

kasireddygariDineshKumarReddy commented on May 30, 2024

Do you mean each unit segment(lets say 1second of audio) is of dimension 128 or d

from neural-audio-fp.

mimbres commented on May 30, 2024

Yes d=128.

from neural-audio-fp.

kasireddygariDineshKumarReddy commented on May 30, 2024

In NFP algorithm ,it was given that
Zk^(org) = g ◦ f (Sk)
Zk^( rep) = g ◦ f (M(Sk ))
and after loop completion
Z= {Z1^(org) , Z1^(rep) , ..., Z N/2^(org), Z N/2^(rep)}
Is Zk^(org) ,Zk^(rep) of 128 dimension or else Z which is combination of all these originals and replicas is of dimension 128?

from neural-audio-fp.

mimbres commented on May 30, 2024

Z^k(*) is kth single element in training batch, and it has a shape (128,).
Z will have a shape (B, 128) where B is training batch size.

from neural-audio-fp.

kasireddygariDineshKumarReddy commented on May 30, 2024

Is Agumentation performed before feature extraction or after log mel spectrogram feature extraction?

from neural-audio-fp.

mimbres commented on May 30, 2024

Most of the augmentations, such as mixing background noise, applying IR filters, and mixing speech (not covered in the paper) are processed in time-domain. In spectral domain, see more details: https://github.com/mimbres/neural-audio-fp/tree/main/model/fp/specaug_chain

from neural-audio-fp.

Recommend Projects

Dimension of Zt about neural-audio-fp HOT 7 CLOSED

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent