Joergen H. Jore
Visualization
Interactive demo in progress
All projects

Clusters or Chaos

Geometric structure of adversarial prompts in LLMs

PythonPyTorchReinforcement LearningLLMsAdversarial ML

[PLACEHOLDER] Master's thesis investigating whether adversarial prompts in deterministic LLMs form geometric clusters in embedding space or exhibit chaotic structure. Uses reinforcement learning and perturbation testing to map the adversarial landscape.

Method
RL + Perturbation Testing
Focus
Adversarial Prompt Geometry
Institution
NTNU