10.21227/4CCA-RR50
Yuntao Wang
Yuntao
Wang
https://orcid.org/0000-0001-9395-2948
Kun Yang
Kun
Yang
Yunzhao Yang
Yunzhao
Yang
Zhenyu Zhang
Zhenyu
Zhang
Xiaowei Yi
Xiaowei
Yi
Xianfeng Zhao
Xianfeng
Zhao
State Key Laboratory of Information Security, Institute of Information Engineering Chinese Academy of Sciences and School of Cyber Security, University of Chinese Academy of Sciences
Audio Steganalysis Dataset
IEEE DataPort
2019
Signal Processing
Security
Steganalysis
Steganography
Audio
Mp3
WAV
2019-05-12
Dataset
Creative Commons Attribution
The steganography and steganalysis of audio, especially compressed audio, have drawn increasing attention in recent years, and various algorithms are proposed. However, there is no standard public dataset for us to verify the efficiency of each proposed algorithm. Therefore, to promote the study field, we construct a dataset including 33038 stereo WAV audio clips with a sampling rate of 44.1 kHz and duration of 10s. And, all audio files are from the Internet through data crawling, which is for a better simulation of a real detection environment. The dataset is used for MP3 steganalysis at this stage. We provide corresponding MP3 encoder, LAME, and steganographic encoder, HCM, EECS and so on, which is developed based on LAME. What's more, some useful python scripts are supplied for samples make in batch. The dataset is still expanding, and we will include AAC, AMR and other audio formats in the future.Keywords: Audio, MP3, Steganalysis, Steganography