Publication date: Available online 5 September 2016
Source:Computer Speech & Language
Author(s): Ali Khodabakhsh, Amir Mohammadi, Cenk Demiroglu
State-of-the-art speaker verification systems are vulnerable to spoofing attacks using speech synthesis. To solve the issue, high-performance synthetic speech detectors (SSDs) for known attack methods have been proposed recently. Here, as opposed to developing new detectors, we investigate new attack strategies. Investigating new techniques that are specifically tailored for spoofing attacks that can spoof the voice verification system and are difficult to detect is expected to increase the security of voice verification systems by enabling the development of better detectors. First, we investigated the vulnerability of an i-vector based verification system to attacks using statistical speech synthesis (SSS) with a particular focus on the case where the attacker has only a very limited amount of data from the target speaker. Even with a single adaptation utterance, the false alarm rate was found to be %23. Still, SSS-generated speech is easy to detect [1, 2] which dramatically reduces its effectiveness. For more effective attacks with limited data, we propose a hybrid statistical/concatenative synthesis approach and show that hybrid synthesis significantly increases the false alarm rate in the verification system compared to the baseline SSS method. Moreover, proposed hybrid synthesis makes detecting synthetic speech more difficult compared to SSS even when very limited amount of original speech recordings are available to the attacker. To further increase the effectiveness of the attacks, we propose a linear regression method that transforms synthetic features into more natural features. Even though the regression approach is more effective at spoofing the detectors, it is not as effective as the hybrid synthesis approach in spoofing the verification system. An interpolation approach is proposed to combine the linear regression and hybrid synthesis methods which is shown to provide the best spoofing performance in most cases.
from #MedicinebyAlexandrosSfakianakis via xlomafota13 on Inoreader http://ift.tt/2chkqBg
via IFTTT
Αρχειοθήκη ιστολογίου
-
►
2020
(289)
- ► Φεβρουαρίου (28)
-
►
2019
(9071)
- ► Δεκεμβρίου (19)
- ► Σεπτεμβρίου (54)
- ► Φεβρουαρίου (3642)
- ► Ιανουαρίου (3200)
-
►
2018
(39872)
- ► Δεκεμβρίου (3318)
- ► Σεπτεμβρίου (3683)
- ► Φεβρουαρίου (2693)
- ► Ιανουαρίου (3198)
-
►
2017
(41099)
- ► Δεκεμβρίου (3127)
- ► Σεπτεμβρίου (2173)
-
▼
2016
(13807)
- ► Δεκεμβρίου (700)
-
▼
Σεπτεμβρίου
(600)
-
▼
Σεπ 05
(50)
- ACACES 2016 poster abstracts
- The interpersonal function of pain: conserving mul...
- Dyscalculie, wat was dat ook weer?
- Heat treatment, microstructure and properties of 7...
- A novel method for the measurement of accurate deg...
- Optimizing practical orienteering problems with st...
- Evaluation of acute Ni bioavailability models for ...
- A survey of substance use for cognitive enhancemen...
- Therapeutic and prophylactic uses of invertebrates...
- Vitamin D receptor gene associations with pulmonar...
- Traditional knowledge and use of wild mushrooms by...
- IJMS, Vol. 17, Pages 1479: Roles of Voltage-Gated ...
- Verbessertes Überleben bei Patienten mit primär me...
- Prä- oder postoperative Strahlentherapie bei retro...
- Quality control of involved field radiotherapy in ...
- Therapie des lokalisierten nodulären Lymphozyten-p...
- Enhanced XOR activity in eNOS-deficient mice: Effe...
- EDITORIAL
- Berrylin June “BJ” Ferguson, MD, Associate Editor
- Masthead - Editorial Board And Table of Contents
- On journals and narrative mediality: the paratextu...
- Correlative microscopy of a carbide-free bainitic ...
- Chain transfer in degenerative RAFT polymerization...
- DFT-based microkinetic modeling of ethanol dehydra...
- The strength of multi-scale modeling to unveil the...
- Adaptation and standardization of a Western tool f...
- A light activated reaction manifold
- First principles kinetic study on the effect of ze...
- Entre verbe et adverbe: grammaticalisation et dégr...
- Differences in environmental preferences towards c...
- A corpus-driven account of the noun classes and ge...
- Understanding the reactivity of unsaturated alcoho...
- Mitral regurgitation as a phenotypic manifestation...
- A single-event microKinetic model for the cobalt c...
- Ni-EXTEND: extending the chronic nickel BLM from p...
- Long live the liver: immunohistochemical and stere...
- The versatility of the mitochondrial presequence p...
- Risk factors for active bleeding from colonic angi...
- Dry-powder inhalers in patients with persistent ai...
- Silica-Coated Nonstoichiometric Nano Zn-Ferrites f...
- Biomimicking Platelet–Monocyte Interactions as a N...
- From The Mine to Cancer Therapy: Natural and Biode...
- Hemoglobin-Conjugated Gelatin Microsphere as a Sma...
- A framework for guiding sustainability assessment ...
- Evaluating SoilGen2 as a tool for projecting soil ...
- PCR detection of Burkholderia multivorans in water...
- Quantification and characterization of glyphosate ...
- An alternative approach to the calculation and ana...
- Naturally Dried Graphene Aerogels with Superelasti...
- Spoofing voice verification systems with statistic...
-
▼
Σεπ 05
(50)
- ► Φεβρουαρίου (1350)
- ► Ιανουαρίου (1400)
-
►
2015
(1500)
- ► Δεκεμβρίου (1450)
Ετικέτες
Εγγραφή σε:
Σχόλια ανάρτησης (Atom)
Δεν υπάρχουν σχόλια:
Δημοσίευση σχολίου