Hi, my name is

Chunxiao(Elin) Ren.

Ship software at the intersection of MLE and full-stack engineering.

Master of Computing (CS specialization) @ NUS. I ship end-to-end — from data and modeling to deployed UIs on the cloud — across LLM agents, applied ML pipelines, and full-stack web. Open to SWE / MLE / DS internships in Singapore.

01. About Me

  • Climate Policy Evidence Knowledge Graph Platform (Neo4j + GraphRAG) · Cyber Risk Assessment Platform · multi-agent cyber-risk dissertation (NUS) · vertical code-gen agent at Seeyon · medical Q&A pipeline over a Neo4j disease KG.
Chunxiao (Elin) Ren

02. Research

Climate Change Wiki home — Topic → Policy Instrument → Outcome → Evidence Papers entry cards with the four Drivers root categories visible (Policy and Regulation, Physical Climate Shock, Technology and Market Shifts, Other Drivers) and per-card paper counts.

Research

Climate Policy Evidence Knowledge Graph Platform

[in progress] · 2026-01 — present

Chunxiao Ren

Research Assistant @ National University of Singapore

A research platform that turns climate-policy PDF literature into a queryable, traceable evidence knowledge graph in bulk — letting an LLM Agent answer researcher questions under a whitelist-validated citation gate, active refusal when evidence is thin, and a four-layer anti-hallucination stack. The backend parses PDFs with MinerU, extracts Finding / Evidence / Driver / Outcome nodes with GPT-4 into a Neo4j Aura cloud graph; Query Router v2 single-pass classifies each question into T1–T4, where T3 metadata queries run through 26 deterministic Cypher templates at zero LLM cost, and T1/T2 fan out across five parallel retrieval routes (semantic / hybrid / graph expansion / community summaries / Cypher precise fallback) before a constrained Agent assembles a structured Section answer; confidence is scored by an independent programmatic dual-track system and offline-calibrated via user-feedback-driven OLS, not self-reported by the LLM. The React + Vite frontend renders answers section-by-section over SSE events, with a built-in force-directed graph visualization and interactive neighbor expansion; deployed on a cloud server and exposed over HTTPS through Caddy + ngrok. Architecture, anti-hallucination stack, and screenshots in the deep dive.

  • Python
  • Neo4j
  • MinerU (PDF parsing)
  • Vector + GraphRAG retrieval
  • GPT-5.4-mini (keyword extraction)
  • OLS calibration
  • React + Vite
  • react-force-graph-2d
  • SQLite (feedback)

Research

Domain-Specific Agents: A Cyber Risk Multi-Agent Framework

[in progress] · 2026-01 — present

Chunxiao Ren

MSc Dissertation @ NUS School of Computing

A domain-structured, evidence-grounded multi-agent framework for cyber-risk analysis (MSc dissertation prototype). The system decomposes the reasoning task into role-specialised agents — Exposure / Likelihood / Impact / Coordinator / Critic — each with its own typed Pydantic schema and prompt, and supports two execution modes: an LLM-only pipeline, and a JELAS-grounded neuro-symbolic pipeline that injects pre-computed knowledge-graph + Datalog risk facts before any LLM call. The framework targets five testable claims: (C1) cyber risk is better modelled as structured reasoning than a single opaque prediction; (C2) domain-aligned roles yield more interpretable intermediate state than generic planner / reviewer roles; (C3) evidence-grounded reasoning improves coherence; (C4) lightweight conditional validation outperforms unconstrained multi-agent debate; (C5) cross-case "analyst experience" can be reused without retraining via a Jaccard × EWMA-recency CaseMemory adapted from LLMTraveler. Block-structured prompts make every component cheaply ablatable, so each claim has a matching A/B experiment.

  • Python
  • Pydantic v2 (typed schemas)
  • LLM orchestration
  • JELAS neuro-symbolic engine
  • Datalog
  • CaseMemory (Jaccard × EWMA-recency)
  • Block-ablatable prompts
CyberAssessment landing page — 'Let's start your Cyber Assessment' headline, a three-step Your Workflow panel (1. Company Profile Collection · 2. Historical Incident Cases · 3. Scenario Modeling & Loss Estimation), an Estimated Duration ~10m From URL to risk report tile, and a Get Started CTA.

Research

Cyber Risk Assessment Platform

[completed] · 2025-09 — 2026-02

Chunxiao Ren

Research Assistant — Lead Developer @ NUS School of Computing

A web platform that delivers cyber-risk assessments to insurance underwriters and SME operators through a guided intake → analysis → report workflow. As the main in-team developer on the product side, I owned the end-to-end web stack — frontend, backend, database, authentication and role-based access, admin tooling, feedback collection, and cloud deployment — wrapping the team's underlying risk engine into a product an underwriter can complete in roughly ten minutes from a single company URL. Internal, pre-commercial project.

  • Python
  • Flask
  • Tailwind CSS
  • Authlib (Google OAuth)
  • bcrypt
  • SQLite
  • OpenAI API (SSE)
  • SentenceTransformers
  • Jina Reader API
  • PyKEEN / NetworkX
  • gunicorn
  • ngrok
  • Vagrant

Research

Medical Q&A System with LLMs (RAG)

[completed] · 2025-01 — 2025-05

Chunxiao Ren

Research Assistant @ Lappeenranta University of Technology(LUT)

A medical Q&A pipeline that replaces vector-store RAG with structured Cypher retrieval over a Neo4j knowledge graph — to address LLMs' hallucination problem in safety-critical domains. Built on DiseaseKG (~44.6k entities, ~312k edges); NER fine-tunes chinese-roberta-wwm-ext + BiLSTM, intent recognition runs as few-shot prompting on a 34B LLM, and answers synthesise from retrieved triples via Qwen / Llama (UI-switchable). Streamlit frontend with user / admin login. Knowledge-graph schema, augmentation strategies, and the full retrieval flow are in the deep dive.

  • Python
  • Neo4j 5.18
  • chinese-roberta-wwm-ext
  • BiLSTM (2-layer) + Linear classifier
  • BIO tagging
  • TF-IDF entity alignment
  • 34B LLM (intent, few-shot + CoT)
  • Qwen / Llama
  • Streamlit

03. Where I’ve Worked

AI Large Model Engineer Intern

@ Beijing Seeyon Internet Software

2025-05 — 2025-08 · Beijing, China · CoMi Agent / V5 PaaS

  • Benchmarked Qwen, GLM, Llama and DeepSeek for enterprise workflow code generation via structured evaluation and prompt engineering, providing the basis for selecting and enhancing Seeyon's in-house model.
  • Fine-tuned the proprietary LLM 'CoMi' to generate Python business-logic scripts for the V5 PaaS platform, enabling automated OA workflow / template creation with 90%+ executable accuracy and a 20% reduction in manual configuration workload.
  • Built a high-quality fine-tuning dataset from real workflow documents and introduced Semantic Consistency Loss + AST Loss, improving both syntactic correctness and business-logic reliability of generated scripts.
  • Python
  • PyTorch
  • LLM Fine-tuning
  • SFT/LoRA
  • Qwen
  • GLM
  • DeepSeek
  • Prompt Engineering

04. Things I’ve Built

Featured Project

Singapore Public Housing Automated Valuation Model

2025-08 — 2025-12 · Collaborative Development

End-to-end ML pipeline for HDB resale price prediction on a Kaggle dataset (162,691 train / 50,000 test transactions, 2017–2025), augmented with five categories of geospatial POIs (~774 points: MRT, primary schools, secondary schools, malls, hawker centres) pulled from Singapore government open APIs. Engineered ~20 proximity features per sample using sklearn BallTree + Haversine, with dual-radius density counts and tier flags (top primary schools, MRT core lines, flagship malls). Final model: CatBoost + LightGBM + XGBoost stacking with 5-fold OOF and a no-intercept linear meta-learner; monotonic constraints on floor area and remaining lease; 3-seed averaging; a two-stage refiner for the top-10% high-price tail. Validation log-RMSE dropped from 0.061 (v2) to 0.050 (v3) — about 18% improvement.

  • BallTree + Haversine over ~774 POIs: ~20 proximity features per sample (dual-radius density, nearest distance, KNN-3, tier flags).
  • Stacking ensemble — CatBoost + LightGBM + XGBoost with 5-fold OOF and a no-intercept linear meta-learner.
  • Two-stage refinement for the top-10% high-price tail; 3-seed averaging (42 / 100 / 2025).
  • Validation log-RMSE 0.061 → 0.050 (≈ 18% improvement) across the v2 → v3 evolution.
  • Python
  • CatBoost
  • LightGBM
  • XGBoost
  • scikit-learn (BallTree, Haversine)
  • Stacking + linear meta
  • pandas
  • NumPy

Singapore Public Housing Automated Valuation Model

Featured Project

Multi-Strategy Movie Recommendation System

2025-01 — 2025-06 · Collaborative Development

A systematic study covering five method families — demographic baseline, content-based recall (TF-IDF + CountVectorizer), KNN collaborative filtering (item / user), SVD matrix factorization with three optimizers (SGD / SGLD / SGHMC), and three hybrid pipelines — evaluated on MovieLens ml-1m. Best single-stage recall: User-CF at 14.54% hit rate; best rating-prediction model: SVD-SGHMC at 0.84117 RMSE.

  • Nine recommenders end-to-end: demographic, 2× content-based, 2× KNN-CF, 3× SVD optimizers, 3× hybrids.
  • User-CF reached 14.54% hit rate (878 / 6040, 15% test split) — strongest single-stage recall.
  • SVD with SGHMC sampler beat SGD and SGLD: 5-fold CV RMSE 0.84117 on MovieLens ml-1m.
  • Recall-then-rerank hybrids trade raw hit rate for rating-aware ordering.
  • Python
  • scikit-learn
  • TF-IDF / CountVectorizer
  • KNN (item / user)
  • SVD
  • SGD / SGLD / SGHMC
  • MovieLens ml-1m
  • TMDB 5000
  • pandas
  • NumPy

Multi-Strategy Movie Recommendation System

Featured Project

TeamClaw — Local-First Multi-Agent Workspace

2026-01 — 2026-03 · Open-source contributor

Contributed to TeamClaw — a local-first multi-agent workspace that exposes an OpenAI-compatible /v1/chat/completions endpoint and ships a visual orchestration layer (OASIS) supporting sequential, parallel, selector, and DAG workflows. Unifies three agent types under a single Team abstraction: Stateless experts, Stateful sessions, and External-API agents (incl. OpenClaw). Team Creator turns a plain-text task description or discovered SOP pages into roles, personas, and a runnable DAG. Backed by a living GraphRAG memory (SQLite + optional Zep mirror), multimodal I/O, Telegram / QQ bot bridges, and Cloudflare Tunnel for one-click public access.

  • OpenAI-compatible local endpoint at /v1/chat/completions — drop-in for any OpenAI client.
  • OASIS engine: sequential / parallel / selector / DAG workflows over unified Stateless / Stateful / External-API agents.
  • Team Creator turns a task description or SOP page into roles, personas, and a runnable DAG.
  • Living GraphRAG memory (SQLite + optional Zep mirror) + multimodal I/O + Telegram / QQ bots + Cloudflare Tunnel public access.
  • Python
  • FastAPI / Flask
  • LangGraph
  • OASIS engine
  • MCP toolchain
  • OpenAI-compatible API
  • GraphRAG (Zep)
  • SQLite
  • Cloudflare Tunnel

TeamClaw — Local-First Multi-Agent Workspace