9.3 敏感信息保护
9.3.1 敏感信息类型
类型
示例
风险
9.3.2 输入侧脱敏
class InputSanitizer:
def __init__(self):
self.patterns = {
'email': r'\b[\w.-]+@[\w.-]+\.\w+\b',
'phone': r'\b\d{11}\b|\b\d{3}-\d{4}-\d{4}\b',
'id_card': r'\b(?:\d{18}|\d{17}[0-9Xx])\b',
'credit_card': r'\b\d{4}[\s-]?\d{4}[\s-]?\d{4}[\s-]?\d{4}\b',
}
def sanitize(self, text: str) -> tuple[str, dict]:
mappings = {}
sanitized = text
for pii_type, pattern in self.patterns.items():
matches = re.findall(pattern, text)
for i, match in enumerate(matches):
placeholder = f"[{pii_type.upper()}_{i}]"
mappings[placeholder] = match
sanitized = sanitized.replace(match, placeholder)
return sanitized, mappings
def restore(self, text: str, mappings: dict) -> str:
restored = text
for placeholder, original in mappings.items():
restored = restored.replace(placeholder, original)
return restored9.3.3 输出侧过滤
9.3.4 系统提示保护
9.3.5 数据分级保护
级别
数据类型
保护措施
9.3.6 数据泄露检测
9.3.7 审计与合规
要求
实现
9.3.8 敏感信息防护的可测指标
指标维度
业务价值与衡量标准
统计方式示例
9.3.9 开源工具推荐
工具
核心能力
适用场景
最后更新于
