【Python 进阶】抽象基类(Abstract Base Class)-EW帮帮网

`abc` 模块和抽象基类(Abstract Base Class)的概念：

本文的示例代码来源于 break-a-scene.

abc 是 Python 标准库中的一个模块，全称是 “Abstract Base Classes”（抽象基类）。
它提供了一种定义抽象类和抽象方法的机制。
在以下代码中，AttentionControl 类继承自 abc.ABC，表示这是一个抽象基类。

让我详细解释一下关键点：

抽象基类的作用：

定义了一个接口规范
强制子类必须实现某些方法
不能直接实例化，必须通过子类来实现

在代码中的具体应用：

class AttentionControl(abc.ABC):
    @abc.abstractmethod
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        raise NotImplementedError

@abc.abstractmethod 装饰器标记 forward 方法为抽象方法
任何继承 AttentionControl 的子类必须实现 forward 方法
如果子类没有实现这个方法，在实例化时会报错

实际使用示例：

# 错误示例 - 会报错
controller = AttentionControl()  # 不能直接实例化抽象类

# 正确示例 - 需要实现抽象方法
class MyController(AttentionControl):
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        # 实现具体的逻辑
        return attn

这种设计模式的好处是：

确保所有子类都实现了必要的方法
提供了一个统一的接口规范
方便代码的维护和扩展

在这个项目中，AttentionControl 作为抽象基类，定义了注意力控制的基本接口，而具体的实现（如 AttentionStore、AttentionReplace 等）都是它的子类，各自实现了不同的注意力控制策略。

让我解释一下这两种实例化的区别：

controller = AttentionControl() 会报错，因为：

AttentionControl 是一个抽象基类（Abstract Base Class）
它包含抽象方法 forward（被 @abc.abstractmethod 装饰）
抽象类不能直接实例化，必须通过子类来实现

self.controller = AttentionStore() 是合法的，因为：

AttentionStore 是 AttentionControl 的子类
它实现了所有必需的抽象方法（包括 forward 方法）
子类可以正常实例化

让我们看看代码中的具体实现：

# 父类（抽象基类）
class AttentionControl(abc.ABC):
    @abc.abstractmethod
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        raise NotImplementedError

# 子类（具体实现）
class AttentionStore(AttentionControl):
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        key = f"{place_in_unet}_{'cross' if is_cross else 'self'}"
        if attn.shape[1] <= 32**2:
            self.step_store[key].append(attn)
        return attn

这就像是一个"合同"：

AttentionControl 定义了一个"合同"，要求所有子类必须实现 forward 方法
AttentionStore 签署了这个"合同"，通过实现 forward 方法来满足要求
因此 AttentionStore 可以实例化，而 AttentionControl 不能

这种设计模式的好处是：

确保所有注意力控制器都实现了必要的方法
提供了统一的接口规范
方便代码的维护和扩展
可以在运行时检查类型是否正确

所以简单来说：

AttentionControl() 会报错，因为它是一个"合同模板"
AttentionStore() 可以正常工作，因为它是一个"已签署的合同"

继承 `abc.ABC` 的好处

让我解释一下为什么 AttentionControl 要继承 abc.ABC：

不是绝对必要的，但是继承 abc.ABC 有几个重要好处：

# 方式1：继承 abc.ABC
class AttentionControl(abc.ABC):
    @abc.abstractmethod
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        raise NotImplementedError

# 方式2：不继承 abc.ABC
class AttentionControl:
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        raise NotImplementedError

继承 abc.ABC 的主要好处：

a) 强制抽象方法实现：

# 如果不继承 abc.ABC，子类可以不实现 forward 方法
class BadController(AttentionControl):
    pass  # 这不会报错

# 如果继承 abc.ABC，子类必须实现 forward 方法
class GoodController(AttentionControl):
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        return attn  # 必须实现

b) 提供类型检查：

# 继承 abc.ABC 后，可以用于类型检查
def process_controller(controller: AttentionControl):
    # 确保 controller 实现了所有必要的方法
    pass

c) 更好的代码文档：

明确表示这是一个抽象基类
表明这个类是用来被继承的
表明这个类定义了接口规范

实际应用中的区别：

# 不继承 abc.ABC 的问题
class AttentionControl:
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        raise NotImplementedError

class BadController(AttentionControl):
    pass  # 可以实例化，但会报错

controller = BadController()  # 可以创建实例
controller.forward()  # 运行时才会报错

# 继承 abc.ABC 的好处
class AttentionControl(abc.ABC):
    @abc.abstractmethod
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        raise NotImplementedError

class BadController(AttentionControl):
    pass  # 直接报错，不能实例化

controller = BadController()  # 直接报错，不能创建实例

为什么选择继承 abc.ABC：

a) 更早的错误检测：

在类定义时就能发现错误
而不是在运行时才发现

b) 更清晰的接口定义：

明确表示这是一个抽象基类
明确哪些方法必须被实现

c) 更好的代码维护：

更容易理解代码结构
更容易发现潜在问题

d) 更好的类型提示：

IDE 可以提供更好的代码补全
静态类型检查工具可以工作

所以，虽然继承 abc.ABC 不是绝对必要的，但它提供了：

更好的错误检测
更清晰的接口定义
更好的代码维护性
更好的类型提示

这些好处使得代码更加健壮和可维护，特别是在大型项目中。

继承的好处

为什么 AttentionStore 要继承 AttentionControl，以及这样做的好处：

必须这么做吗？

不是必须的，但这是一个很好的设计模式
如果不继承，也可以直接实现所有功能
但继承 AttentionControl 提供了更好的代码结构和可维护性

这样做的好处：

class AttentionControl(abc.ABC):
    def __call__(self, attn, is_cross: bool, place_in_unet: str):
        if self.cur_att_layer >= self.num_uncond_att_layers:
            h = attn.shape[0]
            attn[h // 2 :] = self.forward(attn[h // 2 :], is_cross, place_in_unet)
        self.cur_att_layer += 1
        if self.cur_att_layer == self.num_att_layers + self.num_uncond_att_layers:
            self.cur_att_layer = 0
            self.cur_step += 1
            self.between_steps()
        return attn

a) 代码复用：

AttentionControl 提供了通用的注意力控制逻辑
子类只需要实现特定的 forward 方法
避免了重复编写相同的代码

b) 统一接口：

所有注意力控制器都遵循相同的接口
确保所有控制器都有必要的方法
方便其他代码使用这些控制器

c) 扩展性：

可以轻松添加新的注意力控制器
只需要继承 AttentionControl 并实现 forward 方法
不需要重写所有逻辑

d) 维护性：

如果需要修改基础逻辑，只需要修改 AttentionControl
所有子类都会自动获得更新
减少了代码重复和潜在的错误

实际应用示例：

# 基础控制器
class AttentionControl(abc.ABC):
    @abc.abstractmethod
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        raise NotImplementedError

# 存储控制器
class AttentionStore(AttentionControl):
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        # 只实现存储逻辑
        key = f"{place_in_unet}_{'cross' if is_cross else 'self'}"
        if attn.shape[1] <= 32**2:
            self.step_store[key].append(attn)
        return attn

# 替换控制器
class AttentionReplace(AttentionControl):
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        # 实现替换逻辑
        return torch.einsum("hpw,bwn->bhpn", attn_base, self.mapper)

这种设计模式的好处是：

代码更加模块化
更容易维护和扩展
减少了代码重复
提供了统一的接口
方便添加新的功能

所以，虽然不是必须的，但继承 AttentionControl 是一个很好的设计选择，它提供了更好的代码结构和可维护性。

`AttentionControl` 和 `AttentionStore` 的差异

基础功能差异：

# AttentionControl - 基础抽象类
class AttentionControl(abc.ABC):
    def step_callback(self, x_t):
        return x_t  # 基础实现，直接返回输入

    def between_steps(self):
        return  # 空实现

    @abc.abstractmethod
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        raise NotImplementedError  # 必须由子类实现

# AttentionStore - 具体实现类
class AttentionStore(AttentionControl):
    def forward(self, attn, is_cross: bool, place_in_unet: str):
        # 具体实现：存储注意力值
        key = f"{place_in_unet}_{'cross' if is_cross else 'self'}"
        if attn.shape[1] <= 32**2:
            self.step_store[key].append(attn)
        return attn

    def between_steps(self):
        # 具体实现：合并注意力存储
        if len(self.attention_store) == 0:
            self.attention_store = self.step_store
        else:
            for key in self.attention_store:
                for i in range(len(self.attention_store[key])):
                    self.attention_store[key][i] += self.step_store[key][i]
        self.step_store = self.get_empty_store()

属性差异：

# AttentionControl 的属性
self.cur_step = 0
self.num_att_layers = -1
self.cur_att_layer = 0

# AttentionStore 额外添加的属性
self.step_store = self.get_empty_store()  # 存储当前步骤的注意力
self.attention_store = {}  # 存储累积的注意力

主要功能差异：

AttentionControl：

提供基础的注意力控制框架
管理注意力层的计数和步骤
定义抽象接口
不存储任何注意力值

AttentionStore：

实现具体的注意力存储功能
提供注意力值的累积和平均
管理注意力值的存储结构
添加了存储相关的属性和方法

新增方法：

# AttentionStore 特有的方法
@staticmethod
def get_empty_store():
    # 创建空的存储结构
    return {
        "down_cross": [], "mid_cross": [], "up_cross": [],
        "down_self": [], "mid_self": [], "up_self": []
    }

def get_average_attention(self):
    # 计算平均注意力
    return {
        key: [item / self.cur_step for item in self.attention_store[key]]
        for key in self.attention_store
    }

总结差异：

功能定位：
- AttentionControl：基础框架，定义接口
- AttentionStore：具体实现，专注于存储功能
实现程度：
- AttentionControl：抽象类，部分方法为空实现
- AttentionStore：具体类，所有方法都有完整实现
存储能力：
- AttentionControl：不存储数据
- AttentionStore：提供完整的存储和管理功能
使用场景：
- AttentionControl：作为基类，定义规范
- AttentionStore：实际使用，存储注意力值

这种设计体现了面向对象编程中的"抽象与具体"的关系，通过继承实现了代码的复用和扩展。

【Python 进阶】抽象基类(Abstract Base Class)

`abc` 模块和抽象基类(Abstract Base Class)的概念：

继承 `abc.ABC` 的好处

继承的好处

`AttentionControl` 和 `AttentionStore` 的差异

网站公告

今日签到

热门文章

最新发布

【Python 进阶】抽象基类(Abstract Base Class)

abc 模块和抽象基类(Abstract Base Class)的概念：

继承 abc.ABC 的好处

继承的好处

AttentionControl 和 AttentionStore 的差异

网站公告

今日签到

热门文章

最新发布

`abc` 模块和抽象基类(Abstract Base Class)的概念：

继承 `abc.ABC` 的好处

`AttentionControl` 和 `AttentionStore` 的差异