"topic:icassp" — Search

44 results for “topic:icassp”

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

deep-learningicasspinterspeechpytorchquality-of-experiencespeech-qualityspeech-synthesistext-to-speechttsvoice-conversion

DmitryRyumin/ICASSP-2023-24-Papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Python51923Updated 17 hours ago

asrdenoisingdomain-adaptationface-recognitiongenerative-modelsicasspicassp2023icassp2024image-generationkeyword-spottinglanguage-modelingmultimodal-learningmusic-generationself-supervised-learningsemantic-segmentationsignal-processingsignal-restorationspeech-recognitionspoken-language-understandingvad

michaelzhang-ai/Text2Video

ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".

Python44094Updated 1 day ago

aigcavatardeep-learningdigital-humanitiesgangenerative-aiicasspmetaversespeech-synthesistalkingtalking-face-generationtalking-headtalking-headstext-to-videottsvid2vidvideovirtual-humans

IBM/TabFormer

Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)

Python35890Updated 5 days ago

artificial-intelligencebertcredit-card-datasetcredit-card-transactionfraud-detectiongpthuggingfaceicasspicassp2021machine-learningprsa-datasetpytorchtabular-datatransformer

soham97/awesome-sound_event_detection

Reading list for research topics in Sound AI

1968Updated 3 weeks ago

acoustic-scene-classificationaudio-captioningaudio-generationaudio-processingaudio-retrievalicasspinterspeechrepresentation-learningsound-event-detectionzero-shot-learning

Jiaxin-Ye/TIM-Net_SER

[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".

Python18826Updated 1 day ago

bi-directionalcasiaemodbemotion-recognitionemovoicasspiemocapravdesssaveespeech-emotion-recognition

glam-imperial/EmotionalConversionStarGAN

This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".

Python13727Updated 2 months ago

augsburg-universitydata-augmentationdeep-learningdeep-neural-networksemotion-recognitiongenerative-adversarial-networkicasspicassp-2020imperial-college-londonimperial-glamspeech-synthesisstarganstargan-vc

DmitryRyumin/NewEraAI-Papers

The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!

Python1194Updated 1 week ago

artificial-intelligencecomputer-visioncvprdeep-learningemnlpicasspiccvimage-processinginterspeechismirmashine-learningnatural-language-processingneural-networkssignal-processingtext-classificationvideo-processing

khanld/chunkformer

ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription

Python7821Updated just now

asrchunkformerdialectemotiongenderhuggingfaceicasspicassp2025regionspeech-recognitionspeech-to-textsttvietnamesevietnamese-speech-recognition

XuesongYang/end2end_dialog

ICASSP2017: End-to-end joint learning of natural language understanding and dialogue manager

Python7525Updated 7 months ago

icassp

koudounasalkis/voc2vec

This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.

Python471Updated 1 month ago

audio-classificationfoundation-modelsicasspnon-verbal-vocalisationopen-sourcepre-training

6lyc/CDNMF

[ICASSP 2024] Official implementation of our paper "Contrastive Deep Nonnegative Matrix Factorization for Community Detection"

Python428Updated 1 month ago

community-detectioncontrastive-learninggraph-clusteringicasspmachine-learningmatrix-factorizationpythonunsupervised-learning

fonfonx/FaceRecognition

Face Recognition in real-world images [ICASSP 2017]

Python3821Updated 2 years ago

face-recognitionicassplandmarkslfwopencvpythonreal-world-imagesrscsparse-coding

doheejin/HiPAMA

This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Multi-Aspect Attention (ICASSP 2023).

Python382Updated 1 month ago

apaassessmentautomatic-pronunciation-assessmentcapticasspicassp2023language-learningnlppronunciationpronunciation-scoringspeech-processing

30stomercury/Interaction-Aware-Attention-Network

[ICASSP19] An Interaction-aware Attention Network for Speech Emotion Recognition in Spoken Dialogs

Python359Updated 9 months ago

emotion-recognitionicasspicassp-2019speech-emotion-recognitiontensorflow

monetjoe/latex_paper_templates

This repository provides LaTeX templates for academic papers, you can select the appropriate template for your target conference or journal by switching branches. Each branch corresponds to a specific publication venue and follows its official formatting requirements.|本项目提供多种学术论文的 LaTeX 模板，可通过切换分支选择对应的会议或期刊模板。每个分支均针对特定投稿场景设计，并遵循相应的官方排版规范。

TeX34119Updated 3 days ago

icasspicmeieeeismirlatexlatex-template

eleGAN23/QVAE

Official PyTorch implementation of A Quaternion-Valued Variational Autoencoder (QVAE).

Python314Updated 2 months ago

entropyentropy-measuresgenerative-modelsicassppytorch-vaequaternion-domainquaternion-neural-networksquaternionsvaevae-pytorchvaesvariational-autoencodervariational-autoencoders

Neclow/SERABArchived

SERAB: a multi-lingual benchmark for speech emotion recognition

Python285Updated 1 year ago

benchmarkbyolbyol-adeep-learningemotion-recognitionicassppytorchscikit-learnself-supervised-learningspeech-processing

KrishnaswamyLab/ImageFlowNet

[ICASSP 2025 Oral] ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sampled Longitudinal Medical Images

Python243Updated 6 days ago

disease-progressionicasspicassp2025image-forecastingimage-predictionmedical-image-analysisneural-odepytorchspatial-temporaltime-seriestime-series-forecastingtrajectory-predictionunet

orbxball/icassp2019-latex-template

ICASSP 2019 official Latex template

TeX2313Updated 5 months ago

acousticsconferenceicasspicassp-2019ieeelatexsignal-processingspeech

kjw11/Speaker-Aware-CTC

Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.

Python211Updated 6 months ago

asrctcicasspsactc

choyingw/SCADC-DepthCompletion

ICASSP 2021: Scene Completeness-Aware Lidar Depth Completion for Driving Scenario

Python191Updated 3 months ago

3dautonomous-drivingautonomous-vehiclescomputer-visiondeep-neural-networksdepth-completiondepth-estimationicasspicassp2021lidarscene-reconstructionstereo-vision

huangyz0918/kws-continual-learning

Continual Learning Benchmark for Spoken Keyword Spotting

Python171Updated 1 month ago

deep-learningicasspkeyword-extractionkeyword-spotting

seorim0/ResUNet-LC

2D residual U-Net (ResUNet) and a lead combiner (LC) for 12-lead ECG Abnormality Classification

Python151Updated 2 days ago

abnormal-detectionabnormality-detectionclassificationdeep-learningdeep-neural-networksdnnecgecg-classificationelectrocardiogramicasspicassp2024multi-label-classificationpytorchresnet

yousefkotp/Flare-Free-Vision-Empowering-Uformer-with-Depth-Insights

The official implementation for IEEE-ICASSP 2024 paper "Flare-Free Vision: Empowering Uformer with Depth Insights"

Python151Updated 3 months ago

deep-learningdepth-estimationdepth-mapflare-freeflare-removalicasspicassp2024ieee-icasspimage-enhancementimage-enhancingimage-processingimage-restorationneural-networksu-shaped-transformer

ChenLiu-1996/ImageFlowNet

[ICASSP 2025 Oral] ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sampled Longitudinal Medical Images

Python141Updated 3 weeks ago

differential-equationsdisease-progressionicasspicassp2025image-forecastingimage-predictionlatent-spacemedical-image-analysismedical-imagingneural-odepytorchspatial-temporaltime-series-forecastingtrajectory-predictionunet

meowoodie/Regularized-RBM

A regularized version of RBM for unsupervised feature selection.

Python132Updated 3 years ago

data-miningembeddingseventsfeature-selectionicasspicassp-2018l1machine-learningpythonrbmregularizationskipgramstatisticstensorflowunsupervised-learningvariable-selection

kwatcharasupat/directional-sparse-filtering-tf

Python Implementation for Directional Sparse Filtering with Tensorflow/Keras

Python91Updated 5 months ago

blind-source-separationdspicasspicassp-2017icassp-2021signal-processingspeech-separationunsupervised-learning

SMIL-SPCRAS/DAVIS

Official repo for "Audio-Visual Speech Recognition In-the-Wild: Multi-Angle Vehicle Cabin Corpus and Attention-based Method" in ICASSP 2024

JavaScript90Updated 1 year ago

attention-mechanismaudio-visualavsrcorpusicasspicassp2024in-the-wildmulti-modalsignal-processingspatio-temporal-featuresspeech-recognition

Factral/PrivDL

code for the paper: PRIVACY-PRESERVING DEEP LEARNING: LEVERAGING DEFORMABLE OPERATORS FOR SECURE TASK LEARNING

Python70Updated 9 months ago

deep-learningicasspprivacyprivacy-deep-learningprivacy-preservingprivacy-preserving-deep-learningshuffle

Page 1 of 2