Research — TimeStack AI | Behavioral Intelligence Research

Research Areas

Six Active Research Programs Powering Our AI

Our research agenda spans the full stack of behavioral intelligence — from foundational ML methods to applied behavioral science.

Behavioral Language Modeling

How do you train LLMs to reason about human behavior with the nuance of expert coaches? Our research explores domain-adaptive pre-training on behavioral corpora, temporal position encodings that enable models to understand time-dependent context, and RLHF alignment using expert coach evaluations as reward signals.

Domain-Adaptive Pre-training Temporal Reasoning in LLMs Coaching-Aligned RLHF Constrained Generation for Goal Structures

Multi-Scale Temporal Prediction

Human behavior operates on multiple timescales simultaneously — circadian rhythms, weekly patterns, seasonal trends, and multi-year trajectories. We develop temporal fusion architectures that jointly model these scales, with custom attention mechanisms handling the characteristic irregularity of real-world behavioral data.

Temporal Fusion Transformers Irregular Time Series Multi-Horizon Quantile Forecasting Circadian Pattern Discovery

Cross-Domain Causal Discovery

Our graph neural network research focuses on learning personalized causal structures from observational behavioral data. Unlike correlation-based approaches, we develop methods that distinguish causal direction (does stress cause poor sleep, or does poor sleep cause stress?) using Granger causality tests integrated with GNN message passing.

Personalized Causal Graphs Granger-GNN Hybrid Models Domain Interaction Discovery Counterfactual Intervention Planning

Reinforcement Learning for Behavior Change

The intervention optimization problem — when, how, and how much to nudge a user — is fundamentally a sequential decision problem under uncertainty. We develop RL agents that balance short-term engagement with long-term behavioral change, explicitly modeling notification fatigue and the diminishing returns of repetitive interventions.

Just-in-Time Adaptive Interventions Fatigue-Aware Reward Shaping Multi-Objective RL (engagement vs. outcome) Offline RL from Behavioral Logs

Privacy-Preserving Behavioral AI

Behavioral data is deeply personal. Our federated learning research develops methods to train effective shared models without centralizing sensitive data. We combine federated averaging with differential privacy guarantees (epsilon-DP) and investigate secure aggregation protocols for multi-tenant enterprise deployments.

Federated Behavioral Model Training Differential Privacy for Time Series Secure Aggregation Privacy-Utility Tradeoff Analysis

Anomaly Detection for Wellbeing

Detecting behavioral anomalies (burnout, disengagement, mental health decline) requires models that understand individual baselines and can distinguish normal variation from concerning trends. Our VAE-based approach learns personalized behavioral distributions and calibrates detection thresholds per user to minimize false positives while catching genuine signals.

Conditional VAEs for Behavioral Baselines Personalized Anomaly Thresholds Early Warning Signal Detection Ethical Anomaly Reporting Frameworks

Scientific Foundations

Grounded in Behavioral Science

Our AI doesn't operate in a scientific vacuum. Every model is designed around established behavioral science frameworks.

Self-Determination Theory (Deci & Ryan)

Our intervention optimizer is constrained to support autonomy, competence, and relatedness — the three fundamental psychological needs for sustained motivation. The RL reward function penalizes paternalistic interventions that undermine user autonomy.

Habit Loop Theory (Duhigg / Clear)

The Chronos temporal model explicitly encodes cue-routine-reward cycles in its feature representation. Habit formation probability is predicted as a function of consistency, environmental cues, and reward immediacy — aligning with the Atomic Habits framework.

Goal-Setting Theory (Locke & Latham)

The Goal Decomposition LLM generates goals that are specific, measurable, achievable, relevant, and time-bound (SMART). It also maintains optimal difficulty levels — Locke's research shows that moderately difficult goals produce the highest performance.

Compound Effect (Hardy / Kaizen)

The core mathematical model: 1.01^365 = 37.78x. TimeStack's scoring system, goal hierarchies, and temporal prediction all model compound growth — small daily improvements cascading into transformative long-term outcomes.

Cognitive Load Theory (Sweller)

The DomainGraph model limits simultaneous domain recommendations to prevent cognitive overload. The intervention optimizer caps daily nudges based on the user's current cognitive load estimate, derived from their active goal count and recent task complexity.

Broaden-and-Build Theory (Fredrickson)

Positive emotions broaden awareness and build lasting resources. The Wellbeing Sentinel model tracks positive affect indicators and the LLM coaching engine uses upward spirals — leveraging small wins to build momentum for larger behavioral changes.

Open Problems

Active Research Challenges

Behavioral AI is a nascent field. These are the hard problems we're actively working to solve.

Cold Start Personalization

New users have no behavioral history. How do you provide meaningful predictions from day one? We're developing meta-learning approaches that transfer knowledge from similar user cohorts, bootstrapping personalization from minimal demographic and preference signals.

Causal vs. Correlational Boundaries

Observational behavioral data contains both causal and correlational relationships. Distinguishing them matters: "exercise improves mood" (causal, actionable) vs. "happy people exercise more" (correlational, not actionable the same way). We're integrating causal inference methods with our GNN architecture.

Long-Horizon Credit Assignment

Did today's 30-minute workout contribute to the career promotion 6 months later? The causal chain is long and noisy. Our temporal models struggle with credit assignment beyond 90-day horizons, which is a fundamental challenge for modeling compound growth.

Cultural Behavioral Norms

Behavioral patterns vary significantly across cultures. "Work-life balance" means different things in different societies. We're researching culture-adaptive model variants that respect diverse behavioral norms while maintaining universal psychological foundations.

Ethical Intervention Boundaries

When should an AI system intervene in someone's life? Our research explores formal frameworks for intervention ethics — distinguishing between helpful nudges and manipulative persuasion, with constitutional AI constraints applied to our coaching LLM.

Adversarial Behavioral Signals

Users can game gamification systems. Our models need to distinguish genuine behavioral improvement from reward-hacking (e.g., completing trivial tasks for points). We're developing adversarial robustness for our scoring and prediction models.

Advancing the Science of Behavioral AI