Skip to content

Latest commit

 

History

History
323 lines (203 loc) · 19.4 KB

paper_list.md

File metadata and controls

323 lines (203 loc) · 19.4 KB

Survey

  • A survey on sign language literature, Marie Alaghband, Hamid Reza Maghroor, Ivan Garibay, [Paper]

  • A survey on Sign Language machine translation, Adrián Núñez-Marcos, Olatz Perez-de-Viñaspre, Gorka Labaka, [Paper]

Dataset

  • [arXiv:2410.19488] MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset, Xin Shen, Heming Du, Hongwei Sheng, Shuyun Wang, Hui Chen, Huiqiang Chen, Zhuojie Wu, Xiaobiao Du, Jiaying Ying, Ruihan Lu, Qingzheng Xu, Xin Yu [Paper] [Code]

  • [CVPR 2024] LLMs are Good Sign Language Translators [Paper]

  • [CVPR 2024] Neural Sign Actors: A diffusion model for 3D sign language production from text [Paper] [Project]

  • [CVPR 2024] SignGraph: A Sign Sequence is Worth Graphs of Nodes, Shiwei Gan, Yafeng Yin, Zhiwei Jiang, Hongkai Wen, Lei Xie, Sanglu Lu [Paper] [Code]

  • [Sign Language Recognition Dataset] Enhancing SNN-based spatio-temporal learning: A benchmark dataset and Cross-Modality Attention model, [Paper]

  • [arXiv:2409.01073] SCOPE: Sign Language Contextual Processing with Embedding from LLMs, Yuqi Liu, Wenqian Zhang, Sihan Ren, Chengyu Huang, Jingyi Yu, Lan Xu [Paper]

  • [arXiv:2408.10488] Event Stream based Sign Language Translation: A High-Definition Benchmark Dataset and A New Algorithm, Xiao Wang, Yao Rong, Fuling Wang, Jianing Li, Lin Zhu, Bo Jiang, Yaowei Wang, arXiv 2024, [Paper] [Code]

  • EvCSLR: Event-guided Continuous Sign Language Recognition and Benchmark, Jiang, Yu and Wang, Yuehang and Li, Siqi and Zhang, Yongji and Guo, Qianren and Chu, Qi and Gao, Yue [Code]

  • [arXiv:2407.12593] EvSign: Sign Language Recognition and Translation with Streaming Events, Pengyu Zhang, Hao Yin, Zeren Wang, Wenyue Chen, Shengming Li, Dong Wang, Huchuan Lu, and Xu Jia [Paper] [Code]

  • YouTube-ASL: A Large-Scale, Open-Domain American Sign Language-English Parallel Corpus, Part of Advances in Neural Information Processing Systems 36 (NeurIPS 2023) Datasets and Benchmarks Track [Page]

  • BdSLW60: A Word-Level Bangla Sign Language Dataset, Husne Ara Rubaiyeat, Hasan Mahmud, Ahsan Habib, Md. Kamrul Hasan [Paper]

  • A New Dataset for End-to-End Sign Language Translation: The Greek Elementary School Dataset, [Paper] [Dataset]

  • "Word-level deep sign language recognition from video: A new large-scale dataset and methods comparison.", Li, Dongxu, et al., Proceedings of the IEEE/CVF winter conference on applications of computer vision. 2020. [Paper] [Project Page] [Code]

  • PHOENIX-2014 || Continuous Sign Language Recognition: Towards Large Vocabulary Statistical Recognition Systems Handling Multiple Signers Oscar Koller, Jens Forster, Hermann Ney [Paper] [Dataset], German, 1,081 Sign Classes by 9 Signers. 6,841 Samples

  • [CVPR18]PHOENIX-2014-T || Neural Sign Language Translation Necati Cihan Camgoz1 , Simon Hadfield1 , Oscar Koller2 , Hermann Ney2 , Richard Bowden [Paper] [Code] [Dataset], German, 1,066 Sign Classes by 9 Signers. 8,257 Samples

  • [AAAI18]CSL || Video-Based Sign Language Recognition without Temporal Segmentation J. Jie Huang,1 Wengang Zhou,2 Qilin Zhang,3 Houqiang Li,4 Weiping Li [Paper] [Code] [Dataset], Chinese, 178 Sign Classes by 50 Signers. 25000 Samples

  • [CVPR21]CSL-Daily || Improving Sign Language Translation with Monolingual Data by Sign Back-Translation Hao Zhou1 Wengang Zhou1,2,∗ Weizhen Qi1 Junfu Pu1 Houqiang Li [Paper] [Dataset], Chinese, 2000 Sign Classes by 10 Signers. 20654 Samples

  • WLASL https://dxli94.github.io/WLASL/, American English, 2000 Sign Classes by 119 Signers. 21K Samples

  • NMFs-CSL http://home.ustc.edu.cn/~alexhu/Sources/index.html, Chinese, 1067 Sign Classes by 10 Signers. 32K Samples

  • BOBSL https://www.robots.ox.ac.uk/~vgg/data/bobsl/, British English, 2281 Sign Classes by 39 Signers. 452K Samples

  • EventAHU_DVS346_CSL: https://wpan.ahu.edu.cn/l/GFODFg

Year 2024

  • [arXiv:2412.16524] LLaVA-SLT: Visual Language Tuning for Sign Language Translation, Han Liang, Chengyu Huang, Yuecheng Xu, Cheng Tang, Weicai Ye, Juze Zhang, Xin Chen, Jingyi Yu, Lan Xu [Paper]

  • DiffSLT: Enhancing Diversity in Sign Language Translation via Diffusion Model, JiHwan Moon, Jihoon Park, Jungeun Kim, Jongseong Bae, Hyeongwoo Jeon, Ha Young Kim [Paper] [Code]

  • Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation, Jungeun Kim, Hyeongwoo Jeon, Jongseong Bae, Ha Young Kim [Paper]

  • Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation, Shengeng Tang, Jiayi He, Lechao Cheng, Jingjing Wu, Dan Guo, Richang Hong [Paper]

  • SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction, Shester Gueuwou, Xiaodan Du, Greg Shakhnarovich, Karen Livescu, Alexander H. Liu [Paper]

  • [arXiv:2411.12901] Signformer is all you need: Towards Edge AI for Sign Language, Eta Yang, [Paper]

  • [arXiv:2410.19586] Diverse Sign Language Translation, Xin Shen, Lei Shen, Shaozu Yuan, Heming Du, Haiyang Sun, Xin Yu [Paper]

  • [arXiv:2408.10488] Event Stream based Sign Language Translation: A High-Definition Benchmark Dataset and A New Algorithm, Xiao Wang, Yao Rong, Fuling Wang, Jianing Li, Lin Zhu, Bo Jiang, Yaowei Wang, arXiv 2024, [Paper] [Code]

  • [TMM2024] EvCSLR: Event-guided Continuous Sign Language Recognition and Benchmark, Jiang, Yu and Wang, Yuehang and Li, Siqi and Zhang, Yongji and Guo, Qianren and Chu, Qi and Gao, Yue [Paper] [Code]

  • [arXiv:2408.07244] Sign language recognition based on deep learning and low-cost handcrafted descriptors, Alvaro Leandro Cavalcante Carneiro, Denis Henrique Pinheiro Salvadeo, Lucas de Brito Silva [Paper]

  • [arXiv:2408.07065] Fingerspelling within Sign Language Translation, Garrett Tanzer [Paper]

  • [arXiv:2408.08544] Scaling up Multimodal Pre-training for Sign Language Understanding, Wengang Zhou, Weichao Zhao, Hezhen Hu, Zecheng Li, Houqiang Li [Paper]

  • [arXiv:2407.14224] Hierarchical Windowed Graph Attention Network and a Large Scale Dataset for Isolated Indian Sign Language Recognition, Suvajit Patra, Arkadip Maitra, Megha Tiwari, K. Kumaran, Swathy Prabhu, Swami Punyeshwarananda, Soumitra Samanta [Paper]

  • [arXiv:2407.09544] A Transformer-Based Multi-Stream Approach for Isolated Iranian Sign Language Recognition, Ali Ghadami, Alireza Taheri, Ali Meghdari [Paper]

  • [arXiv:2407.02854] Universal Gloss-level Representation for Gloss-free Sign Language Translation and Production, Eui Jun Hwang, Sukmin Cho, Huije Lee, Youngwoo Yoon, Jong C. Park [Paper]

  • [arXiv:2406.12369] A Comparative Study of Continuous Sign Language Recognition Techniques, Sarah Alyami, Hamzah Luqman [Paper]

  • [arXiv:2406.07119] T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text, Aoxiong Yin, Haoyuan Li, Kai Shen, Siliang Tang, Yueting Zhuang [Paper] [Project]

  • [arXiv:2406.06907] SignMusketeers: An Efficient Multi-Stream Approach for Sign Language Translation at Scale, Shester Gueuwou, Xiaodan Du, Greg Shakhnarovich, Karen Livescu [Paper]

  • [arXiv:2405.20666] MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition, Weichao Zhao, Hezhen Hu, Wengang Zhou, Yunyao Mao, Min Wang, Houqiang Li [Paper]

  • Uncertainty-aware sign language video retrieval with probability distribution modeling, Xuan Wu, Hongxiang Li, Yuanjiang Luo, Xuxin Cheng, Xianwei Zhuang, Meng Cao, Keren Fu [Paper]

  • Lin X, Liu M, Liu K, et al. Spike-SLR: An Energy-efficient Parallel Spiking Transformer for Event-based Sign Language Recognition[J]. 2024. [Paper]

  • [arXiv:2405.14312] Improving Gloss-free Sign Language Translation by Reducing Representation Density, Jinhui Ye, Xing Wang, Wenxiang Jiao, Junwei Liang, Hui Xiong [Paper] [Code]

  • [arXiv:2405.12018] Continuous Sign Language Recognition with Adapted Conformer via Unsupervised Pretraining, Neena Aloysius, Geetha M, Prema Nedungadi [Paper]

  • [arXiv:2405.10423] Diversity-Aware Sign Language Production through a Pose Encoding Variational Autoencoder, Mohamed Ilyes Lakhal, Richard Bowden [Paper]

  • [arXiv:2405.10718] SignLLM: Sign Languages Production Large Language Models, Sen Fang, Lei Wang, Ce Zheng, Yapeng Tian, Chen Chen [Paper] [Project]

  • [arXiv:2405.07663] Sign Stitching: A Novel Approach to Sign Language Production, Harry Walsh, Ben Saunders, Richard Bowden [Paper]

  • [arXiv:2405.05672] Multi-Stream Keypoint Attention Network for Sign Language Recognition and Translation, Mo Guan, Yan Wang, Guangkun Ma, Jiarui Liu, Mingzu Sun [Paper] [Code]

  • [ICLR 2024] Sign2GPT: Leveraging Large Language Models for Gloss-Free Sign Language Translation, arXiv:2405.04164, Ryan Wong, Necati Cihan Camgoz, Richard Bowden [Paper]

  • A Hong Kong Sign Language Corpus Collected from Sign-interpreted TV News, LREC-COLING 2024, Zhe Niu, Ronglai Zuo, Brian Mak, Fangyun Wei [Paper] [Homepage] [Github]

  • Enhancing Brazilian Sign Language Recognition through Skeleton Image Representation, arXiv:2404.19148, Carlos Eduardo G. R. Alves, Francisco de Assis Boldt, Thiago M. Paixão [Paper]

  • CorrNet+: Sign Language Recognition and Translation via Spatial-Temporal Correlation, arXiv:2404.11111 Lianyu Hu, Wei Feng, Liqing Gao, Zekang Liu, Liang Wan [Paper] [Code]

  • Dynamic Spatial-Temporal Aggregation for Skeleton-Aware Sign Language Recognition Lianyu Hu, Liqing Gao, Zekang Liu, Wei Feng, [Paper] [Code]

  • Transfer Learning for Cross-dataset Isolated Sign Language Recognition in Under-Resourced Datasets Ahmet Alp Kindiroglu, Ozgur Kara, Ogulcan Ozdemir and Lale Akarun, [Paper] [Code]

  • Improving Continuous Sign Language Recognition with Adapted Image Models, Lianyu Hu, Tongkai Shi, Liqing Gao, Zekang Liu, Wei Feng, arXiv:2404.08226 [Paper] [Code]

  • "StepNet: Spatial-temporal Part-aware Network for Isolated Sign Language Recognition." Shen, Xiaolong, Zhedong Zheng, and Yi Yang. ACM Transactions on Multimedia Computing, Communications and Applications (2024). [Paper]

  • Using an LLM to Turn Sign Spottings into Spoken Language Sentences, Ozge Mercanoglu Sincan, Necati Cihan Camgoz, Richard Bowden [Paper]

  • TCNet: Continuous Sign Language Recognition from Trajectories and Correlated Regions, Hui Lu, Albert Ali Salah, Ronald Poppe [Paper]

  • [AAAI-2024] Conditional Variational Autoencoder for Sign Language Translation with Cross-Modal Alignment, Rui Zhao, Liang Zhang, Biao Fu, Cong Hu, Jinsong Su, Yidong Chen [Paper] [Code]

  • Systemic Biases in Sign Language AI Research: A Deaf-Led Call to Reevaluate Research Agendas, Aashaka Desai, Maartje De Meulder, Julie A. Hochgesang, Annemarie Kocab, Alex X. Lu [Paper]

  • Continuous Sign Language Recognition Based on Motor attention mechanism and frame-level Self-distillation, Qidan Zhu, Jing Li, Fei Yuan, Quan Gan [Paper]

  • Radar-Based Recognition of Static Hand Gestures in American Sign Language, Christian Schuessler, Wenxuan Zhang, Johanna Bräunig, Marcel Hoffmann, Michael Stelzig, Martin Vossiek [Paper]

  • SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning, Hao Chen, Jiaze Wang, Ziyu Guo, Jinpeng Li, Donghao Zhou, Bian Wu, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng [Paper] [Code]

  • A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars, Ronglai Zuo1* Fangyun Wei2* † Zenggui Chen2 Brian Mak1 Jiaolong Yang2 Xin Tong2 [Paper] [Code]

  • Towards Online Sign Language Recognition and Translation, Ronglai Zuo1 Fangyun Wei2† Brian Mak1 [Paper] [Code]

Year 2023

  • [RANLP 2023] Li, Jacky, et al. "Sign Language Recognition and Translation: A Multi-Modal Approach using Computer Vision and Natural Language Processing." Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing. 2023. [Paper]

  • [ICCV 2023] Human Part-wise 3D Motion Context Learning for Sign Language Recognition Taeryung Lee, Yeonguk Oh, Kyoung Mu Lee, [Paper]

  • A Sign Language Recognition System with Pepper, Lightweight-Transformer, and LLM, JongYoon Lim, Inkyu Sa, Bruce MacDonald, Ho Seok Ahn [Paper]

  • Linguistically Motivated Sign Language Segmentation. Moryossef, A., Jiang, Z., Müller, M., Ebling, S., & Goldberg, Y. (2023). arXiv preprint arXiv:2310.13960. [Paper]

  • Sign Language Production with Latent Motion Transformer, Pan Xie Taiying Peng* Yao Du Qipeng Zhang, [Paper]

  • StepNet: Spatial-temporal Part-aware Network for Sign Language Recognition, Xiaolong Shen, Zhedong Zheng and Yi Yang, [Paper]

  • "Transferring cross-domain knowledge for video sign language recognition." Li, Dongxu, et al. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020. [Paper]

  • "Improving Continuous Sign Language Recognition with Consistency Constraints and Signer Removal." Zuo, Ronglai, and Brian Mak. arXiv preprint arXiv:2212.13023 (2022). [Paper]

  • Read and Attend: Temporal Localisation in Sign Language Videos Gul Varol Liliane Momeni Samuel Albanie Triantafyllos Afouras Andrew Zisserman, CVPR_2021 [Paper]

  • "Fully convolutional networks for continuous sign language recognition." Cheng, Ka Leong, et al. European Conference on Computer Vision. Springer, Cham, 2020. [Paper]

Year 2022

  • Jointly Harnessing Prior Structures and Temporal Consistency for Sign Language Video Generation. Yucheng Suo, Zhedong Zheng, Xiaohan Wang, Bang Zhang and Yi Yang, arXiv 2022 [Paper]

  • StepNet: Spatial-temporal Part-aware Network for Sign Language Recognition. arXiv 2022

Year 2021

  • Signbert: Pre-training of hand-model-aware representation for sign language recognition. ICCV 2021

  • Global-local enhancement network for nmf-aware sign language recognition. ACM TOMM 2021

Year 2020 and Before

  • A deep neural framework for continuous sign language recognition by iterative training. TMM 2019

  • Spatial temporal graph convolutional networks for skeleton-based action recognition. AAAI 2018