Research – Jatan Loya

Research

← Back to Home

SAGE‑RT: Synthetic Alignment data Generation for Safety Evaluation and Red Teaming. COLM 2025 (to appear).
Efficacy of the SAGE‑RT Dataset for Model Safety Alignment: A Comparative Study. NeurIPS Pluralistic Alignment Workshop 2024.
VERA: Validation and Enhancement for Retrieval Augmented systems. arXiv 2024.
Robust recovery of adversarial examples. ICML AML 2024.
Measuring the effectiveness of Spinner‑based Randomized‑Response differential privacy communication for sensitive data sharing. SOUPS 2023 (Poster).
Robust adversarial training for detection of adversarial samples. ICICV 2022.
ViT‑inception‑GAN for image colourisation. ICICC 2021.
Privacy‑preserving keystroke analysis using fully homomorphic encryption & differential privacy. ICML TPDP 2021.