ExMachina测试数据构建中关联参数覆盖问题的分析与解决

2025-07-05 00:12:27作者：何举烈Damon

问题背景

在使用ExMachina进行Elixir项目测试时，开发人员经常需要构建包含关联关系的测试数据。然而，当前版本的ExMachina在处理params_with_assocs/2函数时存在一个潜在问题：当用户显式指定关联ID时，该函数仍然会自动生成关联数据并覆盖用户提供的值。

问题现象

考虑以下测试用例：

attrs = params_with_assocs(:hyperlink, source_id: "invalid")
IO.inspect(attrs)
changeset = Hyperlink.changeset(%Hyperlink{}, attrs)
refute changeset.valid?

开发者期望source_id保持为"invalid"以测试无效ID的情况，但实际上输出显示该值被自动生成的哈希ID覆盖：

%{
  source_id: "775adc233d133ff96f25a64934fecc07e33055f1679bda74cd7c412c9c989b15",
  target_id: "622a55ea583fb59a5f9547171d821ad2a30f63d1911d598b21f270fa84d126f2",
  anchor_text: "A test anchor text"
}

技术分析

ExMachina内部机制

ExMachina的params_with_assocs/2函数工作流程如下：

调用build/2构建基础数据结构
通过insert_belongs_to_assocs/2处理所有belongs_to关联
最后使用recursively_strip/1清理数据

问题出在第二步，当前实现会无条件地为所有belongs_to关联创建关联记录，而忽略了用户可能已经显式提供的关联ID。

关联处理逻辑

在Ecto中，belongs_to关联通常通过外键字段(owner_key)表示。当前ExMachina的实现没有检查这些字段是否已存在于用户提供的属性中，导致用户指定的值被覆盖。

解决方案

改进思路

核心改进点是在处理belongs_to关联前，先检查用户是否已提供关联字段值。如果字段已存在，则跳过该关联的自动生成。

具体实现

修改insert_belongs_to_assocs/2函数，增加对用户提供属性的检查：

defp insert_belongs_to_assocs(%{__struct__: struct} = record, module, attrs) do
  assocations = struct.__schema__(:associations)

  Enum.reduce(assocations, record, fn association_name, record ->
    case struct.__schema__(:association, association_name) do
      association = %{__struct__: Ecto.Association.BelongsTo, owner_key: key} ->
        if Map.has_key?(attrs, key) do
          record
        else
          insert_built_belongs_to_assoc(module, association, record)
        end

      _ ->
        record
    end
  end)
end

同时需要修改params_with_assocs/3以传递attrs参数：

def params_with_assocs(module, factory_name, attrs \\ %{}) do
  factory_name
  |> module.build(attrs)
  |> insert_belongs_to_assocs(module, attrs)
  |> recursively_strip
end