Projects - Salma Shaik

auto-kernel

Autonomous roofline-guided GPU kernel optimization. Profiles whether a kernel is compute- or memory-bound, proposes a hypothesis from the optimization guide, modifies the kernel, benchmarks against the reference, and iterates — committing, benchmarking, and reverting autonomously.

beacon

Context assembly for LLM code repair under strict token budgets. Parses Python repos into a knowledge graph (AST → calls, imports, inheritance), runs BM25/dense/BCA retrieval, assembles context that fits the budget, and measures whether better retrieval actually fixes more bugs.

tierra+

Calibration-first retrieval stack for exoplanet atmospheres + NEOs. Forward-model wrappers, opacity ensembles, contamination latents, and reproducible inference in one package.

gymforge

Reproducible, evaluable reinforcement learning environments for training agents on computer-use tasks. Structured framework for generating diverse task batches, running sandboxed episodes, and scoring agent performance.

compress

Activation-aware compression for transformer models. Profiles which attention heads and MLP neurons are most active on your data distribution, then applies more aggressive compression to dormant ones — better task-specific quality at the same compression ratio.

The Third Pillar

Multi-agent simulation measuring LLM strategic behavior in resource competition under asymmetric information and institutional constraints.

pufferX

Object-storage-first hybrid search engine w/ IVF vector search + BM25 full‑text with metadata filters, immediate write visibility via a hot index, and async merge to an S3‑backed cold index. Features adaptive nprobe, NVMe caching, and RRF fusion for robust relevance.

auto-kernel

beacon

tierra+

gymforge

compress

The Third Pillar

pufferX

inductra

shardspark

GPU-Accelerated-Denoiser

kairon