Projects

RLHF-Blender project screenshot

[1] RLHF-Blender

An interactive framework for learning reward models from diverse human feedback, combining pairwise preferences with richer feedback types in one configurable pipeline.

  • Built to support research on multi-type human feedback for RLHF.
  • Includes an interactive UI for feedback collection and reward model experimentation.

[Repo] [Website] [Demo]

smartgpt-ui project screenshot

[2] smartgpt-ui

A lightweight web interface for SmartGPT-style prompting workflows to improve reasoning quality with staged prompting and response refinement.

  • Inspired by AI Explained's SmartGPT workflow and built on top of Chatbot-UI.
  • Provides a practical interface to experiment with multi-step reasoning prompts.

[Repo] [Live Demo]

Tackling the ARC Challenge with Object-Centric World Models project screenshot

[3] Tackling the ARC Challenge with Object-Centric World Models

A research prototype exploring object-centric world models and reinforcement learning for François Chollet's ARC challenge.

  • Focuses on structured representations for better compositional generalization.
Project preview image coming soon

[4] Simple RAG for Confluence

A retrieval-augmented generation workflow to ingest heterogeneous files and produce structured summaries for Confluence pages.

  • Designed for practical team knowledge capture and fast internal search.