Research
← Back to Home
SAGE‑RT: Synthetic Alignment data Generation for Safety Evaluation and Red Teaming
.
COLM 2025 (to appear).
Efficacy of the SAGE‑RT Dataset for Model Safety Alignment: A Comparative Study
.
NeurIPS Pluralistic Alignment Workshop 2024.
VERA: Validation and Enhancement for Retrieval Augmented systems
.
arXiv 2024.
Robust recovery of adversarial examples
.
ICML AML 2024.
Measuring the effectiveness of Spinner‑based Randomized‑Response differential privacy communication for sensitive data sharing
.
SOUPS 2023 (Poster).
Robust adversarial training for detection of adversarial samples
.
ICICV 2022.
ViT‑inception‑GAN for image colourisation
.
ICICC 2021.
Privacy‑preserving keystroke analysis using fully homomorphic encryption & differential privacy
.
ICML TPDP 2021.