使用 Dart 构建基于微内核架构的高性能可插拔 WAF 引擎

后端架构

文章字数: 3.5k

阅读时长: 15 分

一个现实的需求摆在面前：团队的技术栈全面拥抱了 Dart，从移动端（Flutter）到后端服务。然而，在安全防护层面，我们却要依赖 Nginx+Lua 或其他基于 Go/C++ 的 WAF 产品。这不仅引入了额外的技术栈维护成本，也使得 WAF 的行为与我们的核心业务逻辑在监控、部署和开发体验上产生了割裂。我们的痛点很明确：需要一个 Dart 原生的、轻量级且易于扩展的 WAF 解决方案。

最初的构想很简单，用 dart:io 的 HttpServer 实现一个反向代理，在转发请求前后插入一些规则检查逻辑。但这很快就暴露了问题：硬编码的规则逻辑会迅速变成一坨难以维护的代码。每次新增或修改一个防护策略，比如增加一个新的 SQL 注入检测规则，或是调整IP黑名单，都需要修改核心代理代码并重新部署。这种设计是脆弱且僵化的。

真正的挑战在于如何构建一个稳固的内核，它只负责高效地处理网络 I/O 和请求生命周期管理，而将所有的安全策略都抽象为外部的、可独立开发和部署的“插件”。这自然而然地导向了微内核（Microkernel）架构。

微内核与插件化架构的技术选型

选择 Dart 来构建这样一个网络密集型应用并非一时兴起。Dart 的单线程事件循环模型非常适合处理大量并发的 I/O 操作，避免了传统多线程模型下上下文切换的开销和锁竞争的复杂性。对于计算密集型任务，例如复杂的正则表达式匹配或请求体分析，Dart 的 Isolate 提供了真正的并行计算能力，可以将这些耗时操作隔离出去，避免阻塞主事件循环，这是保证 WAF 低延迟的关键。

微内核架构的核心思想是将系统功能划分为两部分：一个最小化的核心（Kernel）和一系列功能性的插件（Plugin）。

内核 (Kernel): 它的职责极其单一。
- 监听网络端口，接收 HTTP 请求。
- 管理插件的生命周期（加载、初始化、卸载）。
- 建立一个清晰的请求处理管道（Pipeline）。
- 将请求对象依次传递给管道中的插件进行处理。
- 根据插件的处理结果，决定是拦截请求、修改请求还是将其转发到上游服务器。
  . 处理底层的网络数据流。
插件 (Plugin): 每个插件都是一个独立的单元，实现一个具体的安全功能。
- IP 黑名单/白名单。
- HTTP Header 合法性检查。
- 请求速率限制。
- SQL 注入、XSS 攻击等特征码检测。

这种架构的优势是显而易见的：内核保持稳定，而安全策略可以像搭积木一样灵活组合和热插拔。开发新规则只需要实现一个新的插件，无需触碰核心代码。

核心引擎的实现

我们从最基础的反向代理服务器开始。这里的关键点在于必须以流（Stream）的方式处理请求和响应体，而不是一次性读入内存。一个常见的错误是先将整个请求体读完再做处理，这在面对大文件上传等场景时会直接耗尽服务器内存。

// file: lib/core/proxy_server.dart

import 'dart:io';
import 'package:logging/logging.dart';
import 'plugin_manager.dart';
import 'context.dart';

class WafServer {
  final _logger = Logger('WafServer');
  final PluginManager _pluginManager;
  final String _upstreamHost;
  final int _upstreamPort;
  
  late HttpServer _server;

  WafServer(this._pluginManager, this._upstreamHost, this._upstreamPort);

  Future<void> start(String host, int port) async {
    try {
      _server = await HttpServer.bind(host, port);
      _logger.info('WAF Server started on $host:$port');
      _logger.info('Upstream server: $_upstreamHost:$_upstreamPort');
      
      await for (var request in _server) {
        // 使用 Future.microtask 确保处理过程不会阻塞 HttpServer 的 accept 循环
        Future.microtask(() => _handleRequest(request));
      }
    } catch (e, s) {
      _logger.severe('Failed to start server', e, s);
      exit(1);
    }
  }

  Future<void> stop() async {
    await _server.close(force: true);
    _logger.info('WAF Server stopped.');
  }

  Future<void> _handleRequest(HttpRequest request) async {
    final context = RequestContext(request);

    // 1. 请求阶段插件处理
    final decision = await _pluginManager.processRequest(context);

    if (decision.action == RuleAction.block) {
      _logger.warning('Request blocked by plugin: ${decision.pluginName}. Reason: ${decision.message}');
      request.response.statusCode = HttpStatus.forbidden;
      request.response.write('Forbidden by WAF.');
      await request.response.close();
      return;
    }

    // 2. 转发请求到上游
    final client = HttpClient();
    try {
      final upstreamUri = Uri(
        scheme: 'http',
        host: _upstreamHost,
        port: _upstreamPort,
        path: request.uri.path,
        query: request.uri.query,
      );

      final clientRequest = await client.openUrl(request.method, upstreamUri);

      // 复制 headers, context.headers 可能被插件修改过
      context.headers.forEach((name, values) {
        // Host header 需要特殊处理
        if (name.toLowerCase() != 'host') {
          clientRequest.headers.set(name, values);
        }
      });
      clientRequest.headers.host = _upstreamHost;

      // 以流的方式 pipe 请求体，这是性能关键
      await request.pipe(clientRequest);

      final clientResponse = await clientRequest.close();
      
      // 3. 响应阶段插件处理 (此示例为简化，暂未实现响应阶段插件)
      
      // 复制上游响应到客户端
      request.response.statusCode = clientResponse.statusCode;
      clientResponse.headers.forEach((name, values) {
        request.response.headers.set(name, values);
      });
      
      // 以流的方式 pipe 响应体
      await clientResponse.pipe(request.response);

    } catch (e, s) {
      _logger.severe('Error forwarding request to upstream', e, s);
      request.response.statusCode = HttpStatus.internalServerError;
      request.response.write('Internal Server Error');
      await request.response.close();
    } finally {
      client.close();
    }
  }
}

这段代码构成了我们 WAF 的骨架。它正确地处理了网络 I/O，并且为插件管理器 _pluginManager 留出了挂载点。RequestContext 是一个关键的数据结构，它封装了原始的 HttpRequest，并为插件提供了一个可控的、安全的交互界面。

插件体系设计

为了让插件系统工作起来，我们需要定义一套契约。

Plugin 接口: 所有插件必须实现的接口。
PluginManager: 负责加载、初始化和按顺序执行插件。
Context 对象: 在内核和插件之间传递请求状态。
Decision 对象: 插件执行后返回的结果，告知内核下一步该做什么。

// file: lib/core/plugin_interface.dart

import 'context.dart';

// 插件执行结果
enum RuleAction { allow, block }

class RuleDecision {
  final RuleAction action;
  final String pluginName;
  final String message;

  RuleDecision.allow(this.pluginName) 
    : action = RuleAction.allow, message = '';

  RuleDecision.block(this.pluginName, this.message)
    : action = RuleAction.block;
}

// 所有插件必须实现的抽象基类
abstract class WafPlugin {
  // 插件的唯一标识名
  String get name;
  
  // 初始化插件，例如从文件加载规则
  Future<void> initialize(Map<String, dynamic> config);
  
  // 处理请求的核心方法
  Future<RuleDecision> processRequest(RequestContext context);
  
  // 资源清理
  Future<void> dispose();
}

PluginManager 负责从配置文件中读取插件列表并实例化它们。

// file: lib/core/plugin_manager.dart

import 'package:logging/logging.dart';
import 'package:yaml/yaml.dart';
import 'dart:io';

import '../plugins/ip_blacklist_plugin.dart';
import '../plugins/header_validator_plugin.dart';
import 'plugin_interface.dart';
import 'context.dart';

class PluginManager {
  final _logger = Logger('PluginManager');
  final List<WafPlugin> _plugins = [];

  // 简单的插件注册表，真实项目中可能使用反射或代码生成
  final Map<String, WafPlugin Function()> _pluginFactory = {
    'ip_blacklist': () => IpBlacklistPlugin(),
    'header_validator': () => HeaderValidatorPlugin(),
    // ... 在此注册更多插件
  };

  Future<void> loadPluginsFromConfig(String configPath) async {
    final configFile = File(configPath);
    if (!await configFile.exists()) {
      _logger.severe('Config file not found: $configPath');
      return;
    }

    final configContent = await configFile.readAsString();
    final config = loadYaml(configContent);
    
    final pluginConfigs = config['plugins'] as YamlList?;
    if (pluginConfigs == null) {
      _logger.warning('No plugins configured.');
      return;
    }

    for (var pluginConfig in pluginConfigs) {
      final name = pluginConfig['name'] as String;
      final enabled = pluginConfig['enabled'] as bool? ?? false;
      
      if (!enabled) continue;

      if (_pluginFactory.containsKey(name)) {
        final plugin = _pluginFactory[name]!();
        final settings = pluginConfig['settings'] as YamlMap? ?? YamlMap();
        
        // 将 YamlMap 转换为 Dart Map
        final dartSettings = settings.nodes.map(
          (k, v) => MapEntry(k.toString(), v.value)
        );

        try {
          await plugin.initialize(dartSettings);
          _plugins.add(plugin);
          _logger.info('Plugin loaded and initialized: $name');
        } catch (e, s) {
          _logger.severe('Failed to initialize plugin: $name', e, s);
        }
      } else {
        _logger.warning('Unknown plugin configured: $name');
      }
    }
  }

  Future<RuleDecision> processRequest(RequestContext context) async {
    for (final plugin in _plugins) {
      try {
        final decision = await plugin.processRequest(context);
        if (decision.action == RuleAction.block) {
          // 一旦有插件决定拦截，立即中断处理流程
          return decision;
        }
      } catch (e, s) {
        _logger.severe('Error executing plugin: ${plugin.name}', e, s);
        // 在真实项目中，这里应该有更完善的容错机制，例如暂时禁用出错的插件
        return RuleDecision.block(
          'WafCore', 
          'An internal error occurred in plugin: ${plugin.name}'
        );
      }
    }
    // 所有插件都允许通过
    return RuleDecision.allow('WafCore');
  }

  Future<void> disposeAll() async {
    for (final plugin in _plugins) {
      await plugin.dispose();
    }
  }
}

配置文件 config.yaml 的结构如下：

# file: config/config.yaml
server:
  host: 0.0.0.0
  port: 8080

upstream:
  host: 127.0.0.1
  port: 3000

plugins:
  - name: ip_blacklist
    enabled: true
    settings:
      # 在真实场景中，路径应该是绝对路径或相对于配置文件的路径
      blacklist_file: 'config/ip_blacklist.txt'
  
  - name: header_validator
    enabled: true
    settings:
      max_content_length: 1048576 # 1 MB
      banned_user_agents:
        - "BadBot/1.0"
        - "EvilScanner/2.1"
      # 一个模拟的、用于检测注入的简单正则
      # 注意：生产环境的正则需要更严谨
      sql_injection_pattern: "(union|select|insert|delete|update|drop|--|'|;)"

插件实现：一个CPU密集型任务的挑战

我们来实现一个检查 Header 的插件。它包含一个基于正则表达式的 SQL 注入模式检测。正则表达式匹配，尤其是复杂模式，是典型的 CPU 密集型任务。如果在主事件循环中执行，当并发量很高时，一个缓慢的正则就可能拖慢所有请求的处理。

这是 Isolate 发挥作用的完美场景。我们将把正则匹配的工作卸载到一个单独的 Isolate 中。

// file: lib/plugins/header_validator_plugin.dart
import 'dart:async';
import 'dart:isolate';
import 'package:logging/logging.dart';
import '../core/plugin_interface.dart';
import '../core/context.dart';

// 用于 Isolate 间通信的消息载体
class _IsolateRequest {
  final SendPort sendPort;
  final String headerValue;
  final String pattern;
  _IsolateRequest(this.sendPort, this.headerValue, this.pattern);
}

class HeaderValidatorPlugin extends WafPlugin {
  final _logger = Logger('HeaderValidatorPlugin');
  late final int _maxContentLength;
  late final List<String> _bannedUserAgents;
  late final String _sqlInjectionPattern;
  
  // Isolate 相关
  late final Isolate _regexIsolate;
  late final ReceivePort _receivePort;
  late final SendPort _sendPort;
  final Completer<SendPort> _sendPortCompleter = Completer<SendPort>();

  @override
  String get name => 'header_validator';

  @override
  Future<void> initialize(Map<String, dynamic> config) async {
    _maxContentLength = config['max_content_length'] ?? 1024 * 1024;
    _bannedUserAgents = List<String>.from(config['banned_user_agents'] ?? []);
    _sqlInjectionPattern = config['sql_injection_pattern'] ?? '';

    // 启动 Isolate
    _receivePort = ReceivePort();
    _regexIsolate = await Isolate.spawn(_regexWorker, _receivePort.sendPort);

    _receivePort.listen((message) {
      if (message is SendPort) {
        // Isolate 启动后，会首先发送自己的 SendPort
        _sendPortCompleter.complete(message);
      }
    });

    _sendPort = await _sendPortCompleter.future;
    _logger.info('Regex worker isolate started.');
  }

  // Isolate 的入口函数，必须是顶级函数或静态方法
  static void _regexWorker(SendPort mainSendPort) {
    final workerReceivePort = ReceivePort();
    mainSendPort.send(workerReceivePort.sendPort);

    workerReceivePort.listen((message) {
      if (message is _IsolateRequest) {
        final regex = RegExp(message.pattern, caseSensitive: false);
        final bool hasMatch = regex.hasMatch(message.headerValue);
        // 将匹配结果发送回主 Isolate
        message.sendPort.send(hasMatch);
      }
    });
  }

  Future<bool> _hasSqlInjection(String value) async {
    if (_sqlInjectionPattern.isEmpty) return false;

    final responsePort = ReceivePort();
    _sendPort.send(_IsolateRequest(responsePort.sendPort, value, _sqlInjectionPattern));
    
    // 异步等待 Isolate 的返回结果
    final result = await responsePort.first;
    return result as bool;
  }

  @override
  Future<RuleDecision> processRequest(RequestContext context) async {
    // 1. 检查 Content-Length
    final contentLengthStr = context.headers.value(HttpHeaders.contentLengthHeader);
    if (contentLengthStr != null) {
      final contentLength = int.tryParse(contentLengthStr);
      if (contentLength != null && contentLength > _maxContentLength) {
        return RuleDecision.block(name, 'Request body too large.');
      }
    }
    
    // 2. 检查 User-Agent
    final userAgent = context.headers.value(HttpHeaders.userAgentHeader);
    if (userAgent != null && _bannedUserAgents.contains(userAgent)) {
        return RuleDecision.block(name, 'Banned user agent.');
    }

    // 3. 检查所有 header 值是否存在 SQL 注入模式
    for (var values in context.headers.values) {
      for (var value in values) {
        if (await _hasSqlInjection(value)) {
           return RuleDecision.block(name, 'Potential SQL injection detected in header: $value');
        }
      }
    }
    
    // 也可以检查 Query 参数
    for (var value in context.request.uri.queryParameters.values) {
      if (await _hasSqlInjection(value)) {
        return RuleDecision.block(name, 'Potential SQL injection detected in query parameter: $value');
      }
    }

    return RuleDecision.allow(name);
  }

  @override
  Future<void> dispose() async {
    _receivePort.close();
    _regexIsolate.kill(priority: Isolate.immediate);
    _logger.info('Regex worker isolate stopped.');
  }
}

这里的实现细节是关键。_regexWorker 是一个独立的执行单元，它在自己的内存空间和事件循环中运行。主 Isolate 通过 SendPort 和 ReceivePort 与其通信，发送需要匹配的字符串和模式，然后异步等待匹配结果。这种方式确保了即使正则匹配需要 100 毫秒，主 Isolate 的事件循环也完全不会被阻塞，可以继续处理其他请求，从而保证了整个 WAF 服务的高吞吐量。

架构的可视化流程

整个请求处理流程可以用一个序列图清晰地表示出来。

sequenceDiagram
    participant Client
    participant WafServer as WAF Core
    participant PluginManager as Plugin Manager
    participant IPPlugin as IP Blacklist Plugin
    participant HeaderPlugin as Header Validator
    participant RegexIsolate as Regex Isolate
    participant Upstream as Upstream Server

    Client->>+WafServer: HTTP Request
    WafServer->>+PluginManager: processRequest(context)
    
    PluginManager->>+IPPlugin: processRequest(context)
    IPPlugin-->>-PluginManager: Decision: allow

    PluginManager->>+HeaderPlugin: processRequest(context)
    Note over HeaderPlugin,RegexIsolate: HeaderPlugin 将Header值发送给Isolate进行正则匹配
    HeaderPlugin->>+RegexIsolate: Post message (value, pattern)
    RegexIsolate-->>-HeaderPlugin: Post result (match: false)
    HeaderPlugin-->>-PluginManager: Decision: allow
    
    PluginManager-->>-WafServer: Final Decision: allow
    WafServer->>+Upstream: Forward Request
    Upstream-->>-WafServer: HTTP Response
    WafServer-->>-Client: Forward Response

局限性与未来迭代方向

当前这个实现只是一个原型，证明了使用 Dart 构建微内核 WAF 的可行性。在投入生产环境之前，还有很多工作要做：

状态共享与分布式限流: 我们的速率限制插件（未在文中展示）如果是基于内存的，那么它只能在单机上工作。要实现分布式限流，需要引入外部存储如 Redis，插件需要能够与这些服务通信。
请求体分析: 当前的插件主要处理 Header 和 Query。对于需要分析请求体（如 JSON/XML/multipart-form）的 WAF 规则，我们需要在不破坏流式处理的前提下，提供一个安全、高效的 Body 读取和解析机制。这可能需要一个缓冲池和更复杂的流控制逻辑。
动态插件加载: 目前插件是在启动时加载的。一个更高级的 WAF 应该支持在运行时动态加载、卸载或更新插件，而无需重启服务。这在 Dart 中可以通过 Isolate 的代码加载机制实现，但会增加架构的复杂性。
更强的容错性: 如果一个插件持续出错或性能低下，应该有机制将其自动“熔断”，暂时从处理管道中移除，并发出告警，而不是让它拖垮整个服务。
性能优化: 虽然使用了 Isolate，但 Dart FFI（Foreign Function Interface）是另一个值得探索的方向。对于那些对性能要求极致的模式匹配算法（例如 Aho-Corasick 算法），可以用 C++ 或 Rust 实现，然后通过 FFI 供 Dart 调用，这通常比纯 Dart 实现有更高的性能。

WAF Dart 架构设计微内核

构建基于 Tekton 和 XState 的分布式 Iceberg 表维护任务幂等性保障架构

2023-10-27 数据工程

Apache Iceberg 分布式锁 XState NoSQL (键值型) Tekton

本篇

使用 Dart 构建基于微内核架构的高性能可插拔 WAF 引擎

2023-10-27 后端架构

WAF Dart 架构设计微内核