evaulate lazy loader's "reset" of a given collection; usually not necessary, plus can step on an eager loader

Under certain circumstances the above error can occur. It's a bit hard to explain, but I will try.

In our case:

An entity joins to itself in a parent/child relation
An instance has its parent relation set to itself
A collection in the parent entity is eagerly loaded as part of a query
The properties of the entity are ordered such that the collection is loaded after the parent

This triggers a sequence of events in which:

The child is loaded
The parent is loaded as part of loading properties, the parent is set as the same object as the child because they are the same entity (same id)
The parent initializes the collection for eager loading, this updates different things in a few places
The child continues setting up its properties, setting the same collection up for lazy loading, but leaving an indirect weak reference to the collection in context.attributes that the eager loader set up
The collection set up in the parent to collect the eagerly loaded entities is garbage collected because it only has a weak reference to it at this point
When a member of the collection is loaded, there is a crash because the underlying collection was garbage collected

The stack trace is as follows:

Traceback (most recent call last):
  File "/home/dobes/sqlalchemy/test/orm/test_temp.py", line 64, in test_bug
    session.query(A).join(A.b).join(b_parent, b_parent.b_id == B.parent_id).join(b_parent.z).filter(BC.value>0).options(joinedload('b').joinedload('parent').joinedload('z')).all()
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/query.py", line 2293, in all
    return list(self)
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/loading.py", line 72, in instances
    rows = [process[0](row, None) for row in fetch]
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/loading.py", line 452, in _instance
    populate_state(state, dict_, row, isnew, only_load_props)
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/loading.py", line 305, in populate_state
    populator(state, dict_, row)
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/strategies.py", line 1419, in load_scalar_from_joined_existing_row
    existing = _instance(row, None)
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/loading.py", line 481, in _instance
    populate_state(state, dict_, row, isnew, attrs)
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/loading.py", line 309, in populate_state
    populator(state, dict_, row)
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/strategies.py", line 1419, in load_scalar_from_joined_existing_row
    existing = _instance(row, None)
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/loading.py", line 481, in _instance
    populate_state(state, dict_, row, isnew, attrs)
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/loading.py", line 309, in populate_state
    populator(state, dict_, row)
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/strategies.py", line 1401, in load_collection_from_joined_existing_row
    _instance(row, result_list)
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/loading.py", line 500, in _instance
    result.append(instance)
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/util/_collections.py", line 753, in append
    self._data_appender(item)
  File "/home/dobes/sqlalchemy/test/../lib/sqlalchemy/orm/collections.py", line 655, in append_without_event
    self._data()._sa_appender(item, _sa_initiator=False)
AttributeError: 'NoneType' object has no attribute '_sa_appender'

I've attached a test case that reproduces the issue. Note that because the order of properties is significant, I did a bit of a hack to sort the properties.

Workaround:

What seems to be working for me is to specify eager loading for both collections - the one on the parent and the one on the child. For example to modify the query options in the attached test case like:

        res = (session.query(A)
               .join(A.b)
               .join(b_parent, b_parent.b_id == B.parent_id)
               .join(b_parent.z).filter(BC.value>0)
               .options(joinedload('b').joinedload('z'))
               .options(joinedload('b').joinedload('parent').joinedload('z')).all()
        )

will not crash, it seems.

Comments (14)