ADR-028: Dynamic Role-Based Access Control (RBAC) System

Overview

Think about permissions on your computer. You might have "admin" access, which sounds powerful - but what does that actually mean? Can you delete system files? Install software? Read other users' private files? The answer depends on what operations the system supports. As software evolves and adds new features, hardcoding permission rules becomes a nightmare.

This is the challenge we face as the knowledge graph system grows. We started with four simple roles (read-only, contributor, curator, admin) and hardcoded permissions like "curators can approve vocabulary." That works fine initially, but what happens when we add AI-generated ontologies? Collaboration workspaces? Tool execution? Do we keep adding if-statements for every new feature? That approach doesn't scale and creates a tangled mess of permission logic.

We need a system where permissions adapt to new resource types without changing code. When someone builds a "collaboration graph" feature, they should be able to register it as a new resource type and define what actions are possible (read, write, invite, moderate). Administrators should be able to create custom roles ("collaboration moderator") and grant fine-grained permissions ("can moderate collaboration graphs in the engineering workspace"). All without touching the authentication code.

This ADR describes dynamic RBAC (Role-Based Access Control) - a three-tier system where resources register themselves, roles can be created on the fly, and permissions can be scoped to specific instances. It's the difference between hardcoded permission rules and a flexible permission engine that grows with the platform.

Context

The current authentication system (ADR-027) has hardcoded roles (read_only, contributor, curator, admin) with static permissions seeded in kg_auth.role_permissions. As the platform evolves to support:

AI-generated ontologies
Structured collaboration graphs
Tool list graphs
Memory systems (conversational memory, agent memory, persistent context)
Multi-tenant workspaces
Custom resource types

We need a dynamic, extensible RBAC system that can: 1. Support new resource types without schema changes 2. Allow administrators to create custom roles 3. Enable fine-grained, scoped permissions (e.g., access to specific ontology) 4. Support role hierarchies and permission inheritance 5. Maintain backwards compatibility with existing roles

Decision

Implement a three-tier RBAC system with dynamic resource registration:

1. Resource Registry (Dynamic Resource Types)

New Table: kg_auth.resources

CREATE TABLE kg_auth.resources (
    resource_type VARCHAR(100) PRIMARY KEY,
    description TEXT,
    parent_type VARCHAR(100) REFERENCES kg_auth.resources(resource_type),
    available_actions VARCHAR(50)[],  -- ['read', 'write', 'delete', 'approve', 'execute']
    supports_scoping BOOLEAN DEFAULT FALSE,  -- Can permissions be scoped to specific instances?
    metadata JSONB,  -- Custom fields per resource type
    registered_at TIMESTAMPTZ NOT NULL DEFAULT NOW(),
    registered_by VARCHAR(100)
);

Example Resources:

resource_type         | parent_type | available_actions                         | supports_scoping
----------------------|-------------|-------------------------------------------|------------------
concepts              | NULL        | ['read', 'write', 'delete']              | FALSE
vocabulary            | NULL        | ['read', 'write', 'approve', 'delete']   | FALSE
jobs                  | NULL        | ['read', 'write', 'approve', 'delete']   | FALSE
users                 | NULL        | ['read', 'write', 'delete']              | FALSE
ontologies            | NULL        | ['read', 'write', 'delete', 'manage']    | TRUE
ontologies.ai_generated | ontologies | ['read', 'write', 'approve']            | TRUE
collaboration_graphs  | NULL        | ['read', 'write', 'invite', 'moderate']  | TRUE
tool_lists            | NULL        | ['read', 'write', 'execute', 'share']    | TRUE
workspaces            | NULL        | ['read', 'write', 'admin']               | TRUE

Note on Memory Systems: Memories are graph-native - they're concepts and edges in specialized ontologies (e.g., memory:user_123), not a separate resource type. See Use Case 4 for details.

2. Dynamic Roles

New Table: kg_auth.roles

CREATE TABLE kg_auth.roles (
    role_name VARCHAR(50) PRIMARY KEY,
    display_name VARCHAR(100) NOT NULL,
    description TEXT,
    is_builtin BOOLEAN DEFAULT FALSE,  -- System roles (cannot be deleted)
    is_active BOOLEAN DEFAULT TRUE,
    parent_role VARCHAR(50) REFERENCES kg_auth.roles(role_name),  -- Role inheritance
    created_at TIMESTAMPTZ NOT NULL DEFAULT NOW(),
    created_by INTEGER REFERENCES kg_auth.users(id),
    metadata JSONB  -- Custom fields (e.g., color, icon)
);

Builtin Roles: - read_only - Read access to public resources - contributor - Can create content - curator - Can approve and manage content - admin - Full system access

Custom Role Examples: - ontology_manager - Manages AI-generated ontologies - collaboration_lead - Moderates collaboration graphs - tool_executor - Can execute tools from tool lists - workspace_owner - Owns a specific workspace

3. Scoped Permissions

Enhanced Table: kg_auth.role_permissions

-- Drop existing and recreate with scoping support
DROP TABLE IF EXISTS kg_auth.role_permissions CASCADE;

CREATE TABLE kg_auth.role_permissions (
    id SERIAL PRIMARY KEY,
    role_name VARCHAR(50) NOT NULL REFERENCES kg_auth.roles(role_name) ON DELETE CASCADE,
    resource_type VARCHAR(100) NOT NULL REFERENCES kg_auth.resources(resource_type),
    action VARCHAR(50) NOT NULL,

    -- Scoping support (optional - NULL means applies to all instances)
    scope_type VARCHAR(50),  -- 'global', 'ontology', 'workspace', 'user', 'instance'
    scope_id VARCHAR(200),   -- Specific instance ID (e.g., ontology_name, workspace_id)
    scope_filter JSONB,      -- Complex filters (e.g., {"ontology_type": "ai_generated", "status": "active"})

    granted BOOLEAN NOT NULL DEFAULT TRUE,  -- Explicit deny support
    inherited_from VARCHAR(50) REFERENCES kg_auth.roles(role_name),  -- Track inheritance

    created_at TIMESTAMPTZ NOT NULL DEFAULT NOW(),
    created_by INTEGER REFERENCES kg_auth.users(id),

    UNIQUE(role_name, resource_type, action, scope_type, scope_id)
);

CREATE INDEX idx_role_perms_role ON kg_auth.role_permissions(role_name);
CREATE INDEX idx_role_perms_resource ON kg_auth.role_permissions(resource_type, action);
CREATE INDEX idx_role_perms_scope ON kg_auth.role_permissions(scope_type, scope_id);

Permission Examples:

-- Global: Admin can read all concepts
('admin', 'concepts', 'read', 'global', NULL, NULL, TRUE, NULL)

-- Scoped: User can manage specific ontology
('ontology_manager', 'ontologies', 'manage', 'instance', 'ml_ontology_v2', NULL, TRUE, NULL)

-- Filtered: Curator can approve AI-generated ontologies
('curator', 'ontologies', 'approve', 'filter', NULL, '{"type": "ai_generated"}', TRUE, NULL)

-- Inherited: Custom role inherits from curator
('custom_curator', 'vocabulary', 'approve', 'global', NULL, NULL, TRUE, 'curator')

-- Explicit deny: Prevent deletion of builtin roles
('contributor', 'roles', 'delete', 'filter', NULL, '{"is_builtin": true}', FALSE, NULL)

4. User Role Assignments (Multiple Roles)

Enhanced Table: kg_auth.user_roles

CREATE TABLE kg_auth.user_roles (
    id SERIAL PRIMARY KEY,
    user_id INTEGER NOT NULL REFERENCES kg_auth.users(id) ON DELETE CASCADE,
    role_name VARCHAR(50) NOT NULL REFERENCES kg_auth.roles(role_name) ON DELETE CASCADE,

    -- Optional: Role assignment can be scoped to workspace/ontology
    scope_type VARCHAR(50),
    scope_id VARCHAR(200),

    assigned_at TIMESTAMPTZ NOT NULL DEFAULT NOW(),
    assigned_by INTEGER REFERENCES kg_auth.users(id),
    expires_at TIMESTAMPTZ,  -- Optional: time-limited roles

    UNIQUE(user_id, role_name, scope_type, scope_id)
);

CREATE INDEX idx_user_roles_user ON kg_auth.user_roles(user_id);
CREATE INDEX idx_user_roles_role ON kg_auth.user_roles(role_name);
CREATE INDEX idx_user_roles_scope ON kg_auth.user_roles(scope_type, scope_id);

Update users table:

-- Keep primary_role for backwards compatibility and default permissions
ALTER TABLE kg_auth.users
    RENAME COLUMN role TO primary_role;

-- Remove CHECK constraint (roles are now dynamic)
ALTER TABLE kg_auth.users
    DROP CONSTRAINT IF EXISTS users_role_check;

5. Permission Checking Logic

Python Permission Checker:

name="__codelineno-7-1" href="#__codelineno-7-1">class PermissionChecker: def can_user(self, user_id: int, action: str, resource_type: str, resource_id: Optional[str] = None) -> bool: class="w"> """ class="sd"> Check if user has permission to perform action on resource. class="sd"> Checks in order: class="sd"> 1. Instance-scoped permissions (most specific) class="sd"> 2. Filter-scoped permissions class="sd"> 3. Global permissions class="sd"> 4. Inherited permissions from parent roles class="sd"> 5. Deny permissions (explicit denies override grants) class="sd"> """ # Get all user roles (including primary_role and assigned roles) roles = self.get_user_roles(user_id, resource_id) # Check for explicit deny first if self.has_explicit_deny(roles, resource_type, action, resource_id): return False # Check permissions in order of specificity for role in roles: # 1. Instance-scoped if resource_id and self.has_instance_permission(role, resource_type, action, resource_id): return True # 2. Filter-scoped if self.has_filter_permission(role, resource_type, action, resource_id): return True # 3. Global if self.has_global_permission(role, resource_type, action): return True # 4. Check parent roles (inheritance) if self.check_inherited_permissions(role, resource_type, action, resource_id): return True return False

FastAPI Dependency:

def require_permission(resource_type: str, action: str, resource_id: Optional[str] = None):
    """
    Dependency that checks if current user has required permission.

    Usage:
        @app.get("/ontologies/{ontology_id}")
        async def get_ontology(
            ontology_id: str,
            _: Annotated[UserInDB, Depends(require_permission("ontologies", "read", ontology_id))]
        ):
            ...
    """
    def dependency(current_user: Annotated[UserInDB, Depends(get_current_active_user)]):
        checker = PermissionChecker()
        if not checker.can_user(current_user.id, action, resource_type, resource_id):
            raise HTTPException(
                status_code=status.HTTP_403_FORBIDDEN,
                detail=f"Missing permission: {action} on {resource_type}"
            )
        return current_user
    return dependency

6. API Endpoints

Resource Management:

GET    /resources                    # List registered resource types
GET    /resources/{resource_type}    # Get resource details
POST   /resources                    # Register new resource type (admin only)
PUT    /resources/{resource_type}    # Update resource definition
DELETE /resources/{resource_type}    # Unregister resource (if no permissions)

Role Management:

GET    /roles                        # List all roles
GET    /roles/{role_name}            # Get role details with permissions
POST   /roles                        # Create new role
PUT    /roles/{role_name}            # Update role
DELETE /roles/{role_name}            # Delete role (if not builtin, no users)
GET    /roles/{role_name}/users      # List users with this role

Permission Management:

GET    /roles/{role_name}/permissions              # List role permissions
POST   /roles/{role_name}/permissions              # Grant permission
DELETE /roles/{role_name}/permissions/{perm_id}    # Revoke permission
PUT    /roles/{role_name}/permissions              # Bulk update permissions

User Role Assignment:

GET    /users/{user_id}/roles        # List user's roles
POST   /users/{user_id}/roles        # Assign role to user
DELETE /users/{user_id}/roles/{role_name}  # Remove role from user

7. CLI Commands

# Resource management
kg admin resource list
kg admin resource get <resource_type>
kg admin resource create <type> --actions read,write,delete --scoped

# Role management
kg admin role list
kg admin role get <role>
kg admin role create <name> --description "..." --inherits <parent_role>
kg admin role delete <role>
kg admin role copy <source> <new_name>

# Permission management
kg admin role permissions <role>                    # List all permissions
kg admin role grant <role> <resource> <action>      # Grant permission
kg admin role revoke <role> <resource> <action>     # Revoke permission
kg admin role grant <role> <resource> <action> --scope instance --id <resource_id>

# User role assignment
kg admin user roles <user_id>                       # List user's roles
kg admin user assign <user_id> <role>               # Assign role
kg admin user unassign <user_id> <role>             # Remove role
kg admin user assign <user_id> <role> --scope workspace --id <workspace_id>

Migration Strategy

Phase 1: Schema Migration (Backwards Compatible)

Create new tables: resources, roles, user_roles

Migrate existing data:

-- Create builtin roles
INSERT INTO kg_auth.roles (role_name, display_name, is_builtin)
VALUES
    ('read_only', 'Read Only', TRUE),
    ('contributor', 'Contributor', TRUE),
    ('curator', 'Curator', TRUE),
    ('admin', 'Administrator', TRUE);

-- Register existing resources
INSERT INTO kg_auth.resources (resource_type, available_actions)
VALUES
    ('concepts', ARRAY['read', 'write', 'delete']),
    ('vocabulary', ARRAY['read', 'write', 'approve', 'delete']),
    ('jobs', ARRAY['read', 'write', 'approve', 'delete']),
    ('users', ARRAY['read', 'write', 'delete']);

-- Migrate existing permissions to new schema
INSERT INTO kg_auth.role_permissions (role_name, resource_type, action, scope_type, granted)
SELECT role, resource, action, 'global', granted
FROM kg_auth.role_permissions_old;

-- Assign primary roles to all users
INSERT INTO kg_auth.user_roles (user_id, role_name)
SELECT id, primary_role FROM kg_auth.users;

Update permission checking to use new system
Keep users.primary_role for backwards compatibility

Phase 2: Add New Resource Types

As new features are added, register them:

# In ontology feature implementation
register_resource(
    resource_type="ontologies",
    description="AI-generated ontology management",
    available_actions=["read", "write", "delete", "manage", "approve"],
    supports_scoping=True
)

# Grant permissions to existing roles
grant_permission("curator", "ontologies", "approve", scope_type="filter",
                 scope_filter={"type": "ai_generated"})

Phase 3: Custom Roles

Allow administrators to create custom roles for specific use cases:

# Create workspace admin role
kg admin role create workspace_admin \
    --description "Workspace administrator" \
    --inherits curator

# Grant workspace-specific permissions
kg admin role grant workspace_admin workspaces admin --scope instance --id engineering_team

Benefits

Extensibility: New resource types can be added without schema changes
Flexibility: Fine-grained, scoped permissions (workspace-level, ontology-level, etc.)
Hierarchy: Role inheritance reduces permission duplication
Multi-tenancy Ready: Scoped permissions enable workspace/tenant isolation
Audit Trail: Track who granted what permission and when
Explicit Deny: Support for explicit permission denials
Time-Limited Access: Roles can expire (temporary access)
Backwards Compatible: Existing hardcoded roles continue to work

Examples

Use Case 1: AI-Generated Ontology Manager

# Register ontology resource
kg admin resource create ontologies \
    --actions read,write,delete,manage,approve \
    --scoped

# Create specialized role
kg admin role create ontology_curator \
    --description "Curates AI-generated ontologies" \
    --inherits curator

# Grant scoped permissions
kg admin role grant ontology_curator ontologies approve \
    --scope filter --filter '{"type": "ai_generated"}'

# Assign to user
kg admin user assign alice ontology_curator

Use Case 2: Collaboration Graph Moderator

# Register collaboration resource
kg admin resource create collaboration_graphs \
    --actions read,write,invite,moderate,delete \
    --scoped

# Create moderator role
kg admin role create collab_moderator \
    --description "Moderates collaboration spaces"

# Grant permissions
kg admin role grant collab_moderator collaboration_graphs moderate --scope global
kg admin role grant collab_moderator collaboration_graphs read --scope global

# Assign to specific collaboration space
kg admin user assign bob collab_moderator \
    --scope instance --id research_team_collab

Use Case 3: Tool Executor (Limited Permissions)

# Register tool list resource
kg admin resource create tool_lists \
    --actions read,execute \
    --scoped

# Create executor role (can run but not modify)
kg admin role create tool_executor \
    --description "Can execute approved tools"

kg admin role grant tool_executor tool_lists read --scope global
kg admin role grant tool_executor tool_lists execute \
    --scope filter --filter '{"approved": true}'

kg admin user assign charlie tool_executor

Use Case 4: Memory System (Graph-Native Conversational Context)

Architecture: Memories are nodes and edges in the knowledge graph, not a separate system. They live in specialized ontologies (e.g., memory:user_123, agent_context_v1) and can link to concepts in other ontologies.

# Memories use existing 'concepts' and 'ontologies' resources
# No new resource type needed - they're graph-native!

# Create memory manager role
kg admin role create memory_manager \
    --description "Manages agent memory and persistent context" \
    --inherits contributor

# Users can read/write concepts in their own memory ontology
kg admin role grant contributor concepts write \
    --scope filter --filter '{"ontology": "memory:$user_id"}'

# Memory managers can read across all memory ontologies (support/debugging)
kg admin role grant memory_manager concepts read \
    --scope filter --filter '{"ontology": "memory:*"}'

# Allow cross-ontology links (memories → other concepts)
# This enables "I remember discussing recursion" → recursion concept
kg admin role grant contributor concepts write \
    --scope filter --filter '{"source_ontology": "memory:*", "edge_type": "RELATED_TO"}'

# Curators can manage memory ontologies (cleanup, archival)
kg admin role grant curator ontologies manage \
    --scope filter --filter '{"ontology_prefix": "memory:"}'

# Example: Assign scoped memory access for specific agent workspace
kg admin user assign diana memory_manager \
    --scope instance --id memory:agent_workspace_123

Key Benefits of Graph-Native Memory: - Memories are concepts - same node/edge structure as all knowledge - Relationships are edges - uses existing vocabulary (RELATED_TO, IMPLIES, etc.) - Cross-ontology links - memories can reference concepts in other ontologies - Unified querying - traverse from memories to concepts seamlessly - Standard permissions - leverage existing concepts and ontologies resources

Example Memory Graph Structure:

(:Concept {label: "Discussion about recursion", ontology: "memory:user_123"})
  -[:OCCURRED_AT {timestamp: "2025-10-11T15:30:00"}]->
  (:Concept {label: "User mentioned Watts lecture", ontology: "memory:user_123"})
  -[:RELATED_TO]->
  (:Concept {label: "Recursive depth", ontology: "watts_lecture_ontology"})

Cold Start Initialization

The migration script includes automatic initialization with minimum viable permissions for a fresh installation:

Builtin Roles Created: - read_only - Can view concepts, vocabulary, jobs - contributor - + Can create/edit concepts and jobs - curator - + Can approve vocabulary and jobs - admin - Full system access including user/role management

Resources Registered: - concepts, vocabulary, jobs, users, roles, resources

Permissions Seeded: - All existing permissions from ADR-027 migrated automatically - Admin given full access to role/resource management - Curator given read access to roles/resources (visibility, no modification)

User Migration: - All existing users automatically get their primary_role as a user_roles assignment - Backwards compatible: users.primary_role column preserved

The system is immediately functional after migration - no manual setup required!

Security Considerations

Explicit Denies: Denies override grants (prevent privilege escalation)
Builtin Roles: Cannot be deleted (system stability)
Permission Inheritance: Clearly tracked (audit trail)
Scope Validation: Validate scope_id exists before granting permission
Rate Limiting: Limit permission check queries (cache frequently checked permissions)
Audit Logging: Log all permission grants/revokes in kg_logs.audit_trail

Performance Optimizations

Permission Cache: Cache user permissions in Redis (TTL: 5 minutes)
Materialized Views: Pre-compute effective permissions per user
Index Strategy: Index on (user_id, resource_type, action) for fast lookups
Lazy Loading: Only resolve parent role permissions when needed
Batch Checking: Check multiple permissions in single query

Future Extensions

Attribute-Based Access Control (ABAC):
Permissions based on user attributes (department, location, etc.)
Dynamic policies: "Allow if user.department == resource.owner_department"
Temporary Elevated Access:
"Break glass" emergency access with automatic audit and expiration
Permission Request Workflow:
Users can request permissions → approval flow → automatic grant
Role Recommendations:
AI suggests roles based on user activity patterns

References

NIST RBAC Standard: https://csrc.nist.gov/projects/role-based-access-control
AWS IAM Best Practices: https://docs.aws.amazon.com/IAM/latest/UserGuide/best-practices.html
OAuth 2.0 Scopes: https://oauth.net/2/scope/

Implementation Checklist

[ ] Create schema migration SQL
[ ] Implement Python permission checker
[ ] Create FastAPI endpoints (resources, roles, permissions)
[ ] Update existing endpoints to use new permission system
[ ] Create TypeScript client models
[ ] Implement CLI commands
[ ] Write migration script for existing data
[ ] Add caching layer (Redis)
[ ] Document permission model
[ ] Write integration tests