BH akademski imenik

Evaluating Machine and Deep Learning Models for Stroke Prediction with Class Imbalance Handling

9. 6. 2026.

0

Stela Lila, Zerina Altoka, Samed Jukic, Bekir Karlik, Jasmin Kevrić

Mediterranean Conference on Embedded Computing

Stroke prediction plays an important role in healthcare, as it allows for the potential implementation of early measures and intervention. The Kaggle stroke dataset was used and compared between deep learning (DL) and traditional machine learning (ML) models. Preprocessing steps include imputation to the missing BMI values, one hot and label encoding, and z-score normalization, all applied before a stratified 5-fold cross validation with confidence intervals for the split. SMOTE was then applied exclusively to the training set to address class imbalance (~95% non-stroke). Eight classifiers were compared against each other which included five ML models: Random Forest (RF), Logistic Regression (LR), K-nearest Neighbor (KNN), Support Vector Machine (SVM), Decision Tree (DT) and three DL architectures: Multi-Layer Perceptron (MLP), Convolutional Neural Network (CNN) and Artificial Neural Network (ANN). Due to the severe class imbalance in the dataset, F1-score was adopted as the primary evaluation metric, as it balances precision, recall and better reflects the model ability to correctly identify the minority stroke class. The most successful models were LR (30% F1-Score, 88% Accuracy) and MLP (28% F1-Score, 84% Accuracy). This suggests that for this dataset, classical ML models may offer competitive performance compared to deep learning on structured tabular data.

Preuzmi PDF

Vidi više

Mathematical Formalization of Vectorization and Ensemble Learning Algorithms for Talent Detection in Educational Systems

18. 3. 2026.

0

Ana Lojić, Samed Jukic

2026 25th International Symposium INFOTEH-JAHORINA (INFOTEH)

The development of reliable Decision Support Systems (DSS) for talent identification requires a rigorous analytical framework capable of processing high-dimensional educational data. This paper presents the mathematical formulation of the machine learning pipeline utilized for classifying student potential, focusing on the algebraic structure of data representation and the optimization of predictive algorithms. We formally define the mapping of unstructured textual attributes into sparse vector spaces using One-Hot Encoding and analyze the dimensionality reduction effects. The study details the training dynamics of classification models, specifically examining the cost function minimization in Decision Trees via the Gini Impurity index and the stochastic aggregation mechanisms within Random Forest ensembles. Furthermore, to address the challenge of class imbalance, we provide a formal definition of performance metrics, including the harmonic mean of precision and recall and the arithmetic mean of indicator functions for Global Top-K Accuracy. By establishing these mathematical foundations, the paper demonstrates how formal optimization directly correlates with the discriminative power and stability of AI-driven educational assessments.

Preuzmi PDF

Vidi više

Benchmarking Wav2Vec and Traditional Speech Recognition in Speech Transcription

26. 11. 2025.

0

Olsi Shehu, Damiana Teliti, Jasmin Kevrić, Samed Jukic, Bekir Karlik

2025 6th International Conference on Communications, Information, Electronic and Energy Systems (CIEES)

This study presents an empirical benchmarking comparison of two distinct speech-to-text approaches under identical conditions: the Speech Recognition module, which utilizes the online Google Web Speech API, and the offline Wav2Vec model developed by Facebook AI. Both approaches facilitate the transformation of spoken language into written language, although they demonstrate unique characteristics in terms of reliance on the internet, speed, and precision. This study utilizes the LJ Speech dataset, which contains short audio segments of a single reader supplemented by their corresponding transcriptions. Both examined models acquire text from the identical dataset and subsequently assess its similarity to the texts within the dataset. Our analysis reveals that wav2vec outperforms the speech recognition model in both accuracy and performance, suggesting the use of wav2vec in speech-to-speech implementations.

Preuzmi PDF

Vidi više

Integrating Big Data Analytics with the Aura Blockchain for Supply Chain Insights

25. 11. 2025.

0

Haris Hanjalic, Samed Jukic

Telecommunications Forum

This paper examines the integration of Big Data analytics with the Aura Blockchain platform to enhance supply chain management in the luxury goods industry. The Aura Blockchain Consortium, established in 2019-2021 by leading luxury brands including LVMH, Prada, and Cartier, provides a permissioned blockchain infrastructure for product traceability and authenticity verification. By 2025, the consortium has grown to 50+ member brands with over 50 million luxury products registered on its blockchain. This study explores how combining blockchain's immutable ledger capabilities with big data analytics techniques can yield valuable insights including end-to-end traceability, anti-counterfeiting measures, supply chain optimization, quality control, and enhanced customer engagement. The paper presents a technical architecture for integration, discusses real-world implementations such as digital product passports, and addresses challenges including data privacy, scalability, interoperability, and organizational adoption. The findings suggest that this technological convergence enables luxury brands to transition from reactive to proactive supply chain management, meeting both regulatory requirements like the EU's Digital Product Passport initiative and evolving consumer expectations for transparency and sustainability.

Preuzmi PDF

Vidi više

Testing Different Models for Brain Tissue Segmentation on ISBR18 Dataset

29. 10. 2025.

1

Stela Lila, Dino Arnaut, Samed Jukic, Bekir Karlik

2025 6th International Workshop on Engineering Technologies and Computer Science (EnT)

Segmentation of brain tissue is an essential task in medical image analysis, particularly in neuroimaging and disease diagnosis. This study evaluates and compares three major segmentation approaches in the ISBR18 dataset: atlas-based methods, machine learning techniques, and deep learning architectures. The atlas-based Majority Voting method achieved the highest performance within its category with a dice similarity coefficient of 0.8477, utilizing anatomical templates for segmentation. Among machine learning techniques, K-means clustering demonstrated robust performance with 96% classification accuracy, offering computational efficiency despite limitations in spatial resolution. The deep learning U-Net model trained for binary segmentation achieved 93% accuracy, benefiting from its encoder-decoder architecture for precise boundary detection. While traditional atlas-based approaches provide robust anatomical consistency and machine learning methods offer computational advantages, deep learning models show promise in handling complex segmentation tasks. Future research could integrate these approaches to enhance segmentation performance in the ISBR18 dataset and lead to more accurate and reliable brain tissue segmentation for clinical applications.

Preuzmi PDF

Vidi više

USING NATURAL LANGUAGE PROCESSING (NLP) FOR CATEGORIZING PAPER TITLES FROM GOOGLE FORMS

12. 6. 2024.

1

Ana Lojić, Zerina Mašetić, Samed Jukic

Contemporary Theory and Practice in Construction

<p>Modern data collection, storage, and processing rely on diverse techniques to handle various types of information, ranging from structured tables to free-form text. This paper explores the captivating application of Natural Language Processing (NLP) for categorizing titles from Google Forms or any other textual data. The process of training an NLP model will be demonstrated through a specific example. Just as we learn from our past experiences, NLP models need to be fed with relevant data and labels. This ensures accurate and efficient processing even when new titles are introduced. We will conclude with a fascinating demonstration of how NLP algorithms analyze the structure and meaning of titles. By identifying keywords and understanding the context, they can automatically classify titles into relevant categories. This dramatically simplifies data organization and analysis, empowering us to extract valuable insights faster.</p>

Preuzmi PDF

Vidi više

Predictive Analysis of Student Enrollment on a Faculty Base on Innovation Research

20. 3. 2024.

2

Ana Lojić, Jasmin Kevric, Samed Jukic

2024 23rd International Symposium INFOTEH-JAHORINA (INFOTEH)

In research aimed at determining the level of interest of high school students in enrolling in colleges, predictive analysis models and comparisons are rarely applied during the classification and processing of various data. All of this leads to significant fluctuations in college admissions, where certain schools are unable to admit a large number of students who show interest in a specific field. On the other hand, high school students lose interest in certain schools, leading to the discontinuation of specific directions essential for today's job market needs. Institutions largely fail to conduct a comparison and linkage of teaching and non-teaching activities when analyzing the talents and interests of high school students from different fields. The goal of this paper is to use programming language classifiers to predict student enrollments in colleges based on the results students demonstrate during regular attendance in high schools through participation in innovation fairs.

Preuzmi PDF

Vidi više

Brain Tumor Detection and Classification Using VGG16 Deep Learning Algorithm and Python Imaging Library

17. 11. 2023.

12

Sulejman Karamehić, Samed Jukic

Bioengineering Studies

Early diagnosis and treatment of brain cancer depend on the detection and categorization of brain tumors. Deep learning algorithms have produced amazing results in medical imaging applications including tumor identification. Most of this field's research has concentrated on applying CNN algorithms like VGG16, DNN, and ANN to this problem. This work describes the identification and classification of brain tumors using the Python Imaging Library (PIL) and the VGG16 deep learning algorithm. A dataset of 7000 MRI pictures categorized by tumor type served as the foundation for the research. The main objective of this study was to develop a high-efficiency, high-accuracy model. We suggested utilizing the VGG16 architecture and preprocessing images with PIL to ensure consistent images for training on a sizable dataset of brain magnetic resonance imaging (MRI) images. A novel technique we have used in our work is one that can analyze a single image and predict the presence of a tumor from the results. The research's methods produced robust tumor detection across the dataset with 96, 9% accuracy, indicating the value of the method in helping medical professionals make informed decisions when diagnosing the presence of tumors.

Preuzmi PDF

Vidi više

Predictive analysis of student enrolment in secondary schools

15. 3. 2023.

1

Ana Lojić, Samed Jukic

2023 22nd International Symposium INFOTEH-JAHORINA (INFOTEH)

In research to determine the degree of interest in enrolling students in certain high schools, predictive analysis and comparison models are rarely used when classifying and processing different data. All this leads to large fluctuations in enrolment in secondary schools, where certain schools are unable to enrol numerous students who show an interest in a particular field. On the other hand, students lose interest in certain schools, which leads to the discontinuation of certain courses necessary for the needs of today's labour market. Institutions responsible for organizing the educational process do not sufficiently compare and connect teaching and non-teaching activities when analysing the talents and interests of elementary school students from different fields. The goal of this work is to predict the enrolment of students in secondary schools, using the classifiers of programming languages, based on the results that students express during regular classes in elementary schools.The results show that the accuracy of the data during the training of the Random Forest predictor is 52%, while in Wolfram Alpha it is 62%

Preuzmi PDF

Vidi više

Application of Business Intelligence in Decision Making for Credit Card Approval

23. 2. 2023.

5

Admel Husejinovic, Nermina Durmic, Samed Jukic

Journal of Intelligence Studies in Business

This paper aims to show how business intelligence can be applied in the credit card approval process. More specifically, the paper investigates how information like an applicant’s age, credit score, debt, income, and prior default can be used in credit card approval prediction.The dataset used for analysis is a publicly available dataset from the UCI machine learning repository. Logistic regression is used to make a prediction model with a reasonable number of attributes for a comprehensible business model. The Chi-square test of independence is used to test the dependence of credit card approval results with attributes. Research uncovers that prior default is supposed to be the most important attribute in the approval process. Finally, the authors propose several visualizations that could help make smarter decisions with effective credit risk assessment.

Preuzmi PDF

Vidi više

Intelligent Tutoring System of Linear Programming

2022.

0

Amor Hasić, Samed Jukic

Advances in Linear Algebra &amp; Matrix Theory

There is a growing technological development in intelligent teaching systems. This field has become interesting to many researchers. In this paper, we present an intelligent tutoring system for teaching mathematics that helps students un-derstand the basics of linear programming using Linear Program Solver and Service for Solving Linear Programming Problems, through which students will be able to solve economic problems. It comes down to determining the minimum or maximum value of a linear function, which is called the objective function, according to pre-set limiting conditions expressed by linear equations and inequalities. The goal function and the limiting conditions represent a mathematical model of the observed problem. Working as a professor of mathematics in high school, I felt the need for one such work and dealing with the study of linear programming as an integral part of mathematics. There are a number of papers in this regard, but exclusively related to traditional ways of working, as stated in the introductory part of the paper. The center of work as well as the final part deals with the study of linear programming using programs that deal with this topic.

Preuzmi PDF

Vidi više

Letter Recognition Using Machine Learning Algorithms

2022.

0

Merima A. Ćeranić, Samed Jukic

Journal of Natural Sciences and Engineering

Optical character recognition represents the mechanical or electronic conversion of handwritten, typed or printed images into coded text. Optical character recognition is widely used as a form of data entry from records that have been printed, and it can include invoices, bank statements, passports and many more. In the research, Optical character recognition reads data from the Re-Captcha dataset of images, converts them into strings, and these strings are used for testing, training and calculating prediction accuracy. The methodologies used are Convolutional neural network and Recurrent neural network. The convolutional neural network consist of neurons that receive data and group them according to similarity. A recurrent neural network cycle can be created between the connections of nodes, allowing the output from nodes to influence the subsequent input to other nodes. For data were used Re-Captcha images, and for the prediction of characters from images was used TensorFlow with Keras. The best results that are produced can be compared between first and last result, where the loss for first result was 20.63 and value loss was 16.45, while last result has loss of 0.56 and value loss of 2.96

Preuzmi PDF

Vidi više

Deep Learning-Based Studies on Pediatric Brain Tumors Imaging: Narrative Review of Techniques and Challenges

28. 5. 2021.

25

Hala Shaari, Jasmin Kevric, Samed Jukic, L. Bešić, D. Jokić, N. Ahmed, Vladimir M. Rajs

Brain Science

Brain tumors diagnosis in children is a scientific concern due to rapid anatomical, metabolic, and functional changes arising in the brain and non-specific or conflicting imaging results. Pediatric brain tumors diagnosis is typically centralized in clinical practice on the basis of diagnostic clues such as, child age, tumor location and incidence, clinical history, and imaging (Magnetic resonance imaging MRI / computed tomography CT) findings. The implementation of deep learning has rapidly propagated in almost every field in recent years, particularly in the medical images’ evaluation. This review would only address critical deep learning issues specific to pediatric brain tumor imaging research in view of the vast spectrum of other applications of deep learning. The purpose of this review paper is to include a detailed summary by first providing a succinct guide to the types of pediatric brain tumors and pediatric brain tumor imaging techniques. Then, we will present the research carried out by summarizing the scientific contributions to the field of pediatric brain tumor imaging processing and analysis. Finally, to establish open research issues and guidance for potential study in this emerging area, the medical and technical limitations of the deep learning-based approach were included.

Preuzmi PDF

Vidi više

Feature selection using cloud-based parallel genetic algorithm for intrusion detection data classification

17. 3. 2021.

31

Dželila Mehanović, Dino Kečo, Jasmin Kevric, Samed Jukic, Adnan Miljković, Zerina Mašetić

Neural computing & applications (Print)

Preuzmi PDF

Vidi više

Analysis of High School Graduate Data Using Database Analytics Tools

2021.

0

Ezana Ceman, Ajdin Salihovic, Samed Jukic

Journal of Natural Sciences and Engineering

It can be confidently stated that access to education is one of the most prized possessions available to us today. Although there are underlying factors such as the discrepancies in the education being provided worldwide, it is imperative that data scientists and all those interested take advantage of the data publicly available to draw necessary insights into how to better the education sector in our respective countries. The purpose of this research is to showcase various analytical insights into the 2020 New York State (NYS) high school graduation rate data using various advanced database systems techniques, specifically using SQL. With these analyses, further studies and conclusions can be drawn for local governments to implement into their plans to increase the quality of the schooling system, to aim for equality for all without reg

Preuzmi PDF

Vidi više

Nema pronađenih rezultata, molimo da izmjenite uslove pretrage i pokušate ponovo!

Publikacije (35)

Filters

Filteri

Datum objave

Uključeni istraživači

Dodatni filteri

Pretplatite se na novosti o BH Akademskom Imeniku