Open Access Theses & Dissertations

Phishibl: A Systematic Evaluation Of Instance-Based Learning Model For Predicting Phishing Susceptibility

Shova Kuikel, University of Texas at El Paso

Date of Award

2025-12-01

Degree Name

Master of Science

Department

Computer Science

Advisor(s)

Palvi P. Aggarwal

Abstract

Despite enormous efforts to develop defenses against phishing attacks, humans still struggle to detect phishing emails given the constantly evolving attacker strategies. This thesis aims to test the predictive capabilities of a cognitive model that represents the individual susceptibility to phishing emails. While training programs aim to raise awareness, most remain outdated and ineffective against evolving attack strategies. Recent advances in Machine Learning, Artificial Intelligence, and Large Language Models (LLMs) offer new defenses, yet understanding human decision processes remains crucial, as effective systems must emulate how people evaluate unfamiliar emails based on prior experience. This research introduces a cognitive model grounded in Instance-Based Learning (IBL) Theory, which captures how individuals compare new emails to past encounters. We evaluated five cognitive model variants against human email classification data. To examine the role of experience representation, we tested three approaches: IBL-Forecaster, which predicts decisions without prior experience to assess whether a model can learn from scratch; IBL-Human-Assisted, which begins with human experiences to evaluate how prior knowledge improves prediction; and IBL-Foregone, where model can learns not only from its actual experiences but also from the counterfactual outcomes. On the other hand, to develop more personalized model, we tested two additional variants: IBL-Tracing, which updates memory based on human decisions, and IBL-Tracing Calibration, which further tunes the model by optimizing the decay parameter after tracing. IBL-Tracing Calibration achieved the highest alignment (81\%), outperforming IBL-Tracing (74\%), IBL-Foregone (72\%), and all other tested models. Using IBL-Tracing and IBL-Tracing Calibration, we further test Latent Semantic Analysis (LSA) and embeddings from open-source LLMs: BERT, Llama-8B, Wizard, and Mistral. LLM embeddings reduce alignment compared to LSA, though applying Principal Component Analysis (PCA) yields modest improvements without surpassing LSA. These results suggest the need to refine how cognitive models represent human context and similarity judgments in phishing detection, and they reveal current limitations in open-source LLM embeddings for modeling human decision processes. Moreover, this work represents an early step toward adaptive, cognitively informed personalized training modules that learn with the user rather than merely assessing or correcting them.

Language

Provenance

Received from ProQuest

Copyright Date

2025-12

File Size

68 p.

File Format

application/pdf

Rights Holder

Shova Kuikel

Recommended Citation

Kuikel, Shova, "Phishibl: A Systematic Evaluation Of Instance-Based Learning Model For Predicting Phishing Susceptibility" (2025). Open Access Theses & Dissertations. 4563.
https://scholarworks.utep.edu/open_etd/4563

Download

Included in

Computer Sciences Commons

COinS

Open Access Theses & Dissertations

Phishibl: A Systematic Evaluation Of Instance-Based Learning Model For Predicting Phishing Susceptibility

Date of Award

Degree Name

Department

Advisor(s)

Abstract

Language

Provenance

Copyright Date

File Size

File Format

Rights Holder

Recommended Citation

Included in

Search

Links

Browse

Author Corner

Open Access Theses & Dissertations

Phishibl: A Systematic Evaluation Of Instance-Based Learning Model For Predicting Phishing Susceptibility

Author

Date of Award

Degree Name

Department

Advisor(s)

Abstract

Language

Provenance

Copyright Date

File Size

File Format

Rights Holder

Recommended Citation

Included in

Share

Search

Links

Browse

Author Corner