BH akademski imenik

Ventricular Arrhythmia Classification Using Similarity Maps and Hierarchical Multi-Stream Deep Learning

1. 11. 2024.

1

Qing Lin, Dino Oglic, Michael J. Curtis, Hak-Keung Lam, Z. Cvetković

IEEE Transactions on Biomedical Engineering

Objective: Ventricular arrhythmias are the primary arrhythmias that cause sudden cardiac death. We address the problem of classification between ventricular tachycardia (VT), ventricular fibrillation (VF) and non-ventricular rhythms (NVR). Methods: To address the challenging problem of the discrimination between VT and VF, we develop similarity maps – a novel set of features designed to capture regularity within an ECG trace. These similarity maps are combined with features extracted through learnable Parzen band-pass filters and derivative features to discriminate between VT, VF, and NVR. To combine the benefits of these different features, we propose a hierarchical multi-stream ResNet34 architecture. Results: Our empirical results demonstrate that the similarity maps significantly improve the accuracy of distinguishing between VT and VF. Overall, the proposed approach achieves an average class sensitivity of 89.68%, and individual class sensitivities of 81.46% for VT, 89.29% for VF, and 98.28% for NVR. Conclusion: The proposed method achieves a high accuracy of ventricular arrhythmia detection and classification. Significance: Correct detection and classification of ventricular fibrillation and ventricular tachycardia are essential for effective intervention and for the development of new therapies and translational medicine.

Preuzmi PDF

Vidi više

A Hybrid GCN-LSTM Model for Ventricular Arrhythmia Classification Based on ECG Pattern Similarity

1. 7. 2024.

3

Qing Lin, Dino Oglic, Hak-Keung Lam, Michael J. Curtis, Z. Cvetković

Annual International Conference of the IEEE Engineering in Medicine and Biology Society

Accurate differentiation between Ventricular Tachycardia (VT) and Ventricular Fibrillation (VF) is essential in the field of cardiology. Recent advancements in deep learning have facilitated automated arrhythmia recognition, surpassing traditional electrocardiogram (ECG) methods that depend on manual feature extraction. Building on our previous work, which emphasized the importance of identifying patterns of regularity, we have developed a model that merges Graph Convolutional Networks (GCN) with Long Short-Term Memory (LSTM) networks. This GCN-LSTM model employs a trainable weighted ϵ-neighborhood graph to capture the similarity among time series within ECG segments. This approach has demonstrated substantial improvement in the classification of VT, VF, and non-ventricular rhythms.

Preuzmi PDF

Vidi više

Graph Neural Networks with Adaptive Readouts

9. 11. 2022.

68

David Buterez, J. Janet, S. Kiddle, Dino Oglic, Pietro Liò

Neural Information Processing Systems

An effective aggregation of node features into a graph-level representation via readout functions is an essential step in numerous learning tasks involving graph neural networks. Typically, readouts are simple and non-adaptive functions designed such that the resulting hypothesis space is permutation invariant. Prior work on deep sets indicates that such readouts might require complex node embeddings that can be difficult to learn via standard neighborhood aggregation schemes. Motivated by this, we investigate the potential of adaptive readouts given by neural networks that do not necessarily give rise to permutation invariant hypothesis spaces. We argue that in some problems such as binding affinity prediction where molecules are typically presented in a canonical form it might be possible to relax the constraints on permutation invariance of the hypothesis space and learn a more effective model of the affinity by employing an adaptive readout function. Our empirical results demonstrate the effectiveness of neural readouts on more than 40 datasets spanning different domains and graph characteristics. Moreover, we observe a consistent improvement over standard readouts (i.e., sum, max, and mean) relative to the number of neighborhood aggregation iterations and different convolutional operators.

Preuzmi PDF

Vidi više

Towards Robust Waveform-Based Acoustic Models

16. 10. 2021.

2

Dino Oglic, Z. Cvetković, Peter Sollich, S. Renals, Bin Yu

IEEE/ACM Transactions on Audio Speech and Language Processing

We study the problem of learning robust acoustic models in adverse environments, characterized by a significant mismatch between training and test conditions. This problem is of paramount importance for the deployment of speech recognition systems that need to perform well in unseen environments. First, we characterize data augmentation theoretically as an instance of vicinal risk minimization, which aims at improving risk estimates during training by replacing the delta functions that define the empirical density over the input space with an approximation of the marginal population density in the vicinity of the training samples. More specifically, we assume that local neighborhoods centered at training samples can be approximated using a mixture of Gaussians, and demonstrate theoretically that this can incorporate robust inductive bias into the learning process. We then specify the individual mixture components implicitly via data augmentation schemes, designed to address common sources of spurious correlations in acoustic models. To avoid potential confounding effects on robustness due to information loss, which has been associated with standard feature extraction techniques (e.g., fbank and mfcc features), we focus on the waveform-based setting. Our empirical results show that the approach can generalize to unseen noise conditions, with 150% relative improvement in out-of-distribution generalization compared to training using the standard risk minimization principle. Moreover, the results demonstrate competitive performance relative to models learned using a training sample designed to match the acoustic conditions characteristic of test utterances.

Preuzmi PDF

Vidi više

Deep Scattering Power Spectrum Features for Robust Speech Recognition

25. 10. 2020.

7

N. M. Joy, Dino Oglic, Z. Cvetković, P. Bell, S. Renals

Interspeech

Deep scattering spectrum consists of a cascade of wavelet transforms and modulus non-linearity. It generates features of different orders, with the ﬁrst order coefﬁcients approximately equal to the Mel-frequency cepstrum, and higher order coefﬁcients recovering information lost at lower levels. We investigate the effect of including the information recovered by higher order coefﬁcients on the robustness of speech recognition. To that end, we also propose a modiﬁcation to the original scattering transform tailored for noisy speech. In particular, instead of the modulus non-linearity we opt to work with power coefﬁcients and, therefore, use the squared modulus non-linearity. We quantify the robustness of scattering features using the word error rates of acoustic models trained on clean speech and evaluated using sets of utterances corrupted with different noise types. Our empirical results show that the second order scattering power spectrum coefﬁcients capture invariants relevant for noise robustness and that this additional information improves generalization to unseen noise conditions (almost 20% relative error reduction on AURORA 4). This ﬁnding can have important consequences on speech recognition systems that typically discard the second order information and keep only the ﬁrst order features (known for emulating MFCC and FBANK values) when representing speech.

Preuzmi PDF

Vidi više

A Deep 2D Convolutional Network for Waveform-Based Speech Recognition

25. 10. 2020.

12

Dino Oglic, Z. Cvetković, P. Bell, S. Renals

Interspeech

Due to limited computational resources, acoustic models of early automatic speech recognition ( ASR ) systems were built in low-dimensional feature spaces that incur considerable information loss at the outset of the process. Several comparative studies of automatic and human speech recognition suggest that this information loss can adversely affect the robustness of ASR systems. To mitigate that and allow for learning of robust models, we propose a deep 2 D convolutional network in the waveform domain. The ﬁrst layer of the network decomposes waveforms into frequency sub-bands, thereby representing them in a structured high-dimensional space. This is achieved by means of a parametric convolutional block deﬁned via cosine modulations of compactly supported windows. The next layer embeds the wave-form in an even higher-dimensional space of high-resolution spectro-temporal patterns, implemented via a 2 D convolutional block. This is followed by a gradual compression phase that selects most relevant spectro-temporal patterns using wide-pass 2 D ﬁltering. Our results show that the approach signiﬁcantly out-performs alternative waveform-based models on both noisy and spontaneous conversational speech ( 24% and 11% relative error reduction, respectively). Moreover, this study provides empirical evidence that learning directly from the waveform domain could be more effective than learning using hand-crafted features.

Preuzmi PDF

Vidi više

Edinburgh Research Explorer A Deep 2D Convolutional Network for Waveform-Based Speech Recognition

2020.

0

Dino Oglic, Z. Cvetković, P. Bell, S. Renals

Due to limited computational resources, acoustic models of early automatic speech recognition ( ASR ) systems were built in low-dimensional feature spaces that incur considerable information loss at the outset of the process. Several comparative studies of automatic and human speech recognition suggest that this information loss can adversely affect the robustness of ASR systems. To mitigate that and allow for learning of robust models, we propose a deep 2 D convolutional network in the waveform domain. The ﬁrst layer of the network decomposes waveforms into frequency sub-bands, thereby representing them in a structured high-dimensional space. This is achieved by means of a parametric convolutional block deﬁned via cosine modulations of compactly supported windows. The next layer embeds the waveform in an even higher-dimensional space of high-resolution spectro-temporal patterns, implemented via a 2 D convolutional block. This is followed by a gradual compression phase that selects most relevant spectro-temporal patterns using wide-pass 2 D ﬁltering. Our results show that the approach signiﬁcantly outperforms alternative waveform-based models on both noisy and spontaneous conversational speech ( 24% and 11% relative error reduction, respectively). Moreover, this study provides empirical evidence that learning directly from the waveform domain could be more effective than learning using hand-crafted features. by means of a non-parametric 2 D convolutional layer. is followed

Vidi više

Bayesian Parznets for Robust Speech Recognition in the Waveform Domain

23. 6. 2019.

0

Dino Oglic, Z. Cvetković, Peter Sollich

Preuzmi PDF

Vidi više

Parzen Filters for Spectral Decomposition of Signals

23. 6. 2019.

0

Dino Oglic, Z. Cvetković, Peter Sollich

arXiv.org

Preuzmi PDF

Vidi više

Learning Waveform-Based Acoustic Models Using Deep Variational Convolutional Neural Networks

23. 6. 2019.

8

Dino Oglic, Z. Cvetković, Peter Sollich

IEEE/ACM Transactions on Audio Speech and Language Processing

We investigate the potential of stochastic neural networks for learning effective waveform-based acoustic models. The waveform-based setting, inherent to fully end-to-end speech recognition systems, is motivated by several comparative studies of automatic and human speech recognition that associate standard non-adaptive feature extraction techniques with information loss, which can adversely affect robustness. Stochastic neural networks, on the other hand, are a class of models capable of incorporating rich regularization mechanisms into the learning process. We consider a deep convolutional neural network that first decomposes speech into frequency sub-bands via an adaptive parametric convolutional block where filters are specified by cosine modulations of compactly supported windows. The network then employs standard non-parametric 1D convolutions to extract relevant spectro-temporal patterns while gradually compressing the structured high dimensional representation generated by the parametric block. We rely on a probabilistic parametrization of the proposed neural architecture and learn the model using stochastic variational inference. This requires evaluation of an analytically intractable integral defining the Kullback–Leibler divergence term responsible for regularization, for which we propose an effective approximation based on the Gauss–Hermite quadrature. Our empirical results demonstrate a superior performance of the proposed approach over comparable waveform-based baselines and indicate that it could lead to robustness. Moreover, the approach outperforms a recently proposed deep convolutional neural network for learning of robust acoustic models with standard FBANK features.

Preuzmi PDF

Vidi više

Scalable Learning in Reproducing Kernel Krein Spaces

6. 9. 2018.

26

Dino Oglic, Thomas Gärtner

International Conference on Machine Learning

Preuzmi PDF

Vidi više

Large Scale Learning with Kreĭn Kernels

6. 9. 2018.

0

Dino Oglic, Thomas Gärtner

arXiv.org

We extend the Nystr\"om method for low-rank approximation of positive definite Mercer kernels to approximation of indefinite kernel matrices. Our result is the first derivation of the approach that does not require the positive definiteness of the kernel function. Building on this result, we then devise highly scalable methods for learning in reproducing kernel Kre\u{\i}n spaces. The main motivation for our work comes from problems with structured representations (e.g., graphs, strings, time-series), where it is relatively easy to devise a pairwise (dis)similarity function based on intuition/knowledge of a domain expert. Such pairwise functions are typically not positive definite and it is often well beyond the expertise of practitioners to verify this condition. The proposed large scale approaches for learning in reproducing kernel Kre\u{\i}n spaces provide principled and theoretically well-founded means to tackle this class of problems. The effectiveness of the approaches is evaluated empirically using kernels defined on structured and vectorial data representations.

Preuzmi PDF

Vidi više

Learning in Reproducing Kernel Kreı̆n Spaces

10. 7. 2018.

34

Dino Oglic, Thomas Gärtner

International Conference on Machine Learning

Preuzmi PDF

Vidi više

A Unified Analysis of Random Fourier Features

24. 6. 2018.

5

Zhu Li, Jean-François Ton, Dino Oglic, D. Sejdinovic

arXiv.org

Random Fourier features is a widely used, simple, and effective technique for scaling up kernel methods. The existing theoretical analysis of the approach, however, remains focused on specific learning tasks and typically gives pessimistic bounds which are at odds with the empirical results. We tackle these problems and provide the first unified risk analysis of learning with random Fourier features using the squared error and Lipschitz continuous loss functions. In our bounds, the trade-off between the computational cost and the expected risk convergence rate is problem specific and expressed in terms of the regularization parameter and the \emph{number of effective degrees of freedom}. We study both the standard random Fourier features method for which we improve the existing bounds on the number of features required to guarantee the corresponding minimax risk convergence rate of kernel ridge regression, as well as a data-dependent modification which samples features proportional to \emph{ridge leverage scores} and further reduces the required number of features. As ridge leverage scores are expensive to compute, we devise a simple approximation scheme which provably reduces the computational cost without loss of statistical efficiency.

Preuzmi PDF

Vidi više

Towards a Unified Analysis of Random Fourier Features

24. 6. 2018.

175

Zhu Li, Jean-François Ton, Dino Oglic, D. Sejdinovic

International Conference on Machine Learning

Preuzmi PDF

Vidi više

Nema pronađenih rezultata, molimo da izmjenite uslove pretrage i pokušate ponovo!

Publikacije (22)

Filters

Filteri

Datum objave

Uključeni istraživači

Dodatni filteri

Pretplatite se na novosti o BH Akademskom Imeniku