blob: 61a952141bb87371fe213a9cb7dd45f8b9d26264 [file] [log] [blame]
:orphan:
=================
Weak References
=================
:Author: John McCall
:Date: 2013-05-23
**Abstract:** This paper discusses the general concept of weak
references, including various designs in other languages, and proposes
several new core language features and a more sophisticated runtime
support feature which can be exploited in the standard library.
.. warning:: This document was used in planning Swift 1.0; it has not been kept
up to date and does not describe the current or planned behavior of Swift.
Reference Graphs
================
Let's run through some basic terminology.
Program memory may be seen abstractly as a (not necessarily connected)
directed graph where the nodes are objects (treating transient
allocations like local scopes as objects) and the edges are
references. (Multi-edges and self-edges are permitted, of course.)
We may assign a level of strength to each of these edges. We will
call the highest level *strong*; all others are some flavor of *weak*.
Every object has a *strength of reference* which arises from the
reference graph. This can be any level of strength or the special
value *unreferenced*. The strength of a path is the minimum strength
of any edge in the path. The strength of a set of paths is the
maximum strength of all the paths, unless the set is empty, in which
case it is *unreferenced*.
In general, the implementation is only outright forbidden to
deallocate an object if it is strongly referenced. However,
being somehow weakly referenced may trigger some sort of additional
guarantee; see the Language Precedents section.
In a cycle-collecting environment, certain nodes are given special
treatment as *roots*; these nodes are always strongly referenced.
Otherwise, the strength of reference for an object is the strength
of all paths from any root to the object.
In a non-cycle-collecting environment, the strength of reference for
an object is the strength of all the direct references to that
object, taken as length=1 paths. Note that this environmental
consideration becomes a language guarantee: even if the implementation
can trivially prove that an object is referenced only by itself, it
is still not permitted to deallocate the object.
It is common for certain kinds of reference to not receive a full
guarantee. For example, a strong reference from a local variable
may lose effectiveness as soon as the variable is no longer needed
(but before it formally leaves scope). This is pervasive in GC
environments but also true in e.g. ObjC ARC.
In some programs, especially event-driven programs like UIs or
servers, it can be useful to consider the *static* reference graph as
captured during a notional steady state. Local scopes may form many
references to objects in the static graph, but unless the scope
persists indefinitely or a major restructuring is triggered, these
references are likely to have no effect on the reference graph, and so
their strength of reference has no semantic importance. Understanding
this helps to explain why many environments provide no direct way to
form a local weak reference.
Language and Library Precedents
===============================
We're only going to discuss *managed* precedents here.
It is possible to create a kind of weak reference by just not
providing any special behavior to an object reference; if the
object is deallocated, the reference will dangle and uses of it
will likely crash or cause corruption. This could happen by
e.g. declining to insert reference-count manipulations for a
particular variable (in an ARC-like environment) or not mapping
a variable's frame offset as a pointer in a type map (in a
precise-GC environment). We will call this *dangling weak*.
Objective-C
-----------
All modes of Objective-C automate memory management for
synthesized properties. In GC and ARC, accessors just
use the normal semantics for the underlying ivar (plus
copy/atomic guarantees, of course). In MRC, accessors
use dangling weak semantics unless they're :code:`retain`
or :code:`copy`, in which case they maintain a +1 refcount
invariant on the referent.
In GC and ARC, variables qualified with :code:`__weak` are
immediately zeroed out when the referenced object begins
deallocation. There is no syntactic difference on use;
it's just possible that the value read will be :code:`nil`
instead of whatever was last written there, possibly causing
the loading code to crash (e.g. on an ivar access) or silently
do nothing (e.g. on a message send). There is an opt-in
warning in ARC for certain uses of values loaded from
:code:`__weak` ivars.
In GC, it is also possible to construct a dangling
weak reference by storing an object pointer into (1) unscanned
heap memory or (2) an instance variable that's not of
object-pointer type and isn't qualified with :code:`__strong`
or :code:`__weak`. Otherwise, object references are strong
(including all references from local scopes).
In ARC, it is possible to construct a dangling weak reference
by using the :code:`__unsafe_unretained` qualifier or by
bridging a pointer value to a C pointer type.
C++
---
C++ smart pointers (e.g. :code:`std::unique_ptr`) typically
permit the creation of a dangling-weak reference by
providing an accessor to get the pointer as a normal C
pointer. (Even if they didn't have :code:`get()`, you could
manually call :code:`operator->` to get the same effect.)
C++'s :code:`std::shared_ptr` permits the formation of
weak pointers (:code:`std::weak_ptr`) from shared pointers.
It is not possibly to directly use a weak pointer; it must
first be converted back to a :code:`shared_ptr`, either by
using the :code:`lock()` operation (which produces a null
pointer if the referent has been deallocated) or by directly
constructing a :code:`shared_ptr` with the :code:`weak_ptr`
(which throws an exception if the referent has been deallocated).
There is also a way to explicitly query whether a
:code:`weak_ptr` is still valid, which may be more efficient
than checking the result of the cast.
Java
----
Java does not provide any facility for dangling weak references.
The standard library does provide three levels of weak reference
(in :code:`java.lang.ref`). References cannot be re-seated
(although they can be explicitly cleared), and users must call
:code:`get()` in order to access the value, which may yield
:code:`null`.
There is a great deal of interesting discussion of these
reference classes `here <http://www.kdgregory.com/index.php?page=java.refobj>`_.
Java :code:`Reference` objects may be constructed with an
optional :code:`ReferenceQueue`; if so, then when the
object's reachability changes, the reference object will be
added to that queue. This permits data structures to clean
up after cleared soft references without needing to either
periodically scan the entire structure or be fully lazy.
Additional data may be added to the reference object by
subclassing it.
The references are presented in order of decreasing strength.
:code:`SoftReference` is a sort of quasi-strong reference
which holds onto the object until the VM begins to run out
of memory. Soft references to softly-referenced objects are
guaranteed to have been cleared before the VM can throw an
:code:`OutOfMemoryError`. The reference will be cleared
before it is added to its reference queue (and so the
reference queue cannot resurrect the object). The intent
of soft references is to enable memory-sensitive caches,
but in practice a memory-sensitive cache would probably
want to implement a more subtle replacement strategy than
"drop things at random as soon as memory runs low". A
more interesting use is a memory-guided circuit-breaker:
when building up a very large structure, hold it in a
soft reference, and if that references goes null during
construction, just bail out. But that's a pretty tricky
use-case to get right.
:code:`WeakReference` is intended for use in non-memory-sensitive
weak caches, like a uniquing cache; it persists only as long
as the referent is more strongly referenced. The reference
will be cleared before it is added to its reference queue (and
so the reference queue cannot resurrect the object).
:code:`PhantomReference` provides a way to attach extra
finalization to an object without actually using finalizers
(which have several problems, including the ability to
resurrect the object). The phantom reference *always*
presents :code:`null` as its value and is therefore itself
useless as a reference. Phantom references are enqueued
after the object is finalized and therefore at a point when
there can be no references to the object within the VM
at all. However, the object itself cannot be deallocated
until the phantom references are all cleared or themselves
deallocated, which I believe is for the convenience of native
code that may hold a dangling weak reference to the referent
(or which may be able to directly read the reference).
.NET
----
The :code:`WeakReference` class in .NET is similar to
Java's :code:`WeakReference` class in that the value
cannot be accessed directly; it must be accessed
via the :code:`Target` property, which may yield
:code:`null`. The reference may be reseated to a
different value.
Weak references may be created *long*, which permits the
target object to be finalized but not actually deallocated.
Python
------
A :code:`weakref` acts like a function object; it is created
with a particular value, which cannot be reseated. The
function will yield :code:`None` if the referent is collected.
There is library functionality to automatically proxy a value
as a weak reference. An exception is thrown if an operation
is performed on the proxy but the referent has been collected.
A :code:`weakref` may be constructed with a callback function.
The callback will be called after the weak reference is cleared;
it is, however, passed the weak ref object itself.
Ruby
----
A :code:`WeakRef` is automatically a proxy for an object.
There is a :code:`weakref_alive` method to query whether the
reference is still alive; another other operation will cause
an exception to be thrown.
Rust
----
As far as I can tell, there is nothing like a weak reference
in Rust at the moment.
A *managed pointer* (:code:`@int`) is a strong reference
subject to GC.
An *owning pointer* (:code:`~int`) is a strong reference
that cannot be cloned (copying the pointer actually copies the
underlying data).
A *borrowed pointer* (:code:`&int`) is essentially a dangling
weak reference that is subject to static restrictions which
ensure that it doesn't actually dangle. It is thus primarily
a performance optimization.
A *raw pointer* (:code:`*int`) is a dangling weak reference.
Haskell
-------
Yes, of course Haskell has weak references.
A :code:`Weak t` is an association between a hidden key
and a visible value of type :code:`t`.
:code:`doRefWeak theRef` is an :code:`IO (Maybe t)`.
A weak reference may be constructed with an optional
:code:`IO ()` which will be run when the referent is
collected. This finalizer may (somehow) refer to the key
and value without itself keeping them alive; it is also
explicitly permitted to resurrect them.
Use Cases
=========
There are many problems that are potentially addressable with
functionality like weak references. It is not at all obvious
that they should be addressed with the same language feature.
Back references
---------------
Given that Swift is not cycle-collecting, far and away the most
important use case in the static reference graph is that of the
*back-reference*: a reference *R* to an object which holds a strong
reference (possibly indirectly) to the object holding *R*. Examples
include:
- A 'previousNode' pointer in a doubly-linked list.
- A 'parent' pointer in a render tree.
- An edge in a general graph structure.
These have several properties in common:
- Using strong references would require a lot of explicit
code to tear down the reference cycles.
- These references may be accessed very frequently, so
performance is important.
- It is not always feasible to make these references valid
immediately on construction.
- Traversing a reference after the referent is deallocated is likely a
sign that something has been kept alive longer than it was meant to
be. However, programmers may reasonably differ about the correct
response to this: crashing, and therefore encouraging the programmer
to track down a root cause, or simply writing the operation to
handle both cases correctly. Ultimately, this choice comes down to
philosophy.
Caches
------
Weak caches are used in order to prevent a cache from taking
over all available memory. By being tied to the reachability
of a value, the cache prevents entries from spuriously
expiring when their values are still in active use; but by
using weak references, the cache permits the system to
deallocate values that are no longer in use.
Generally, a data structure using weak references extensively
also needs some way to receive notification that the weak
reference was collected. This is because entries in the data
structure are likely to have significant overhead even if the
value is collected. A weak data structure which receives no
notification that a reference has been invalidated must either
allow these entries to accumulate indefinitely or must
periodically scan the entire structure looking for stale entries.
A weak reference which permits immediate deallocation of its
referent when the last strong reference is dropped is
substantially less useful for the implementation of a weak
cache. It is a common access pattern (for, say, a memoizing
cache) for a value to be looked up many times in rapid
succession, but for each use to be temporarily disjoint
from the others. A naive use of weak references in this case
will simply cause the cache to thrash. This problem is less
likely to arise in an environment with nondeterministic
collection because the entry is likely to service multiple
lookups between collections.
It is likely that users implementing weak data structures
would prefer a highly flexible infrastructure centered around
resurrection and notifications of reaching a zero refcount
than a more rigid system built directly into the language.
Since the Swift model is built around statically-inserted
operations rather than a memory scanner, this is much more
workable.
External Finalization
---------------------
Finalization models built around calling a method on the
finalized object (such as Objective-C's :code:`-dealloc`)
suffer from a number of limitations and problems:
- Since the method receives a pointer to the object being
deallocated, the implementation must guard against
attempts to resurrect the object. This may complicate
and/or slow down the system's basic reference-management
logic, which tends to be quite important for performance.
- Since the method receives a pointer to the object being
deallocated, the implementation must leave the object at
least a minimally valid state until the user code is
complete. For example, the instance variables of a
subclass cannot be destroyed until a later phase of
destruction, because a superclass finalizer might invoke
subclass behavior. (This assumes that the dynamic type
of the object does not change during destruction, which
is an alternative that brings its own problems.)
- Finalization code must be inherent to the object; other
objects cannot request that code be run when the object
is deallocated. For example, an object that registers
itself to observe a certain event source must explicitly
deregister itself in a finalizer; the event source cannot
simply automatically drop the object when it is
deallocated.
Optimization
------------
Functions often create a large number of temporary references. In a
reference-counting environment like Swift, these references require
the implementation to implicitly perform operations to increment and
decrement the reference count. These operations can be quite fast,
but they are not free, and our experience has been that the
accumulated cost can be quite significant. A straightforward local
static analysis can eliminate many operations, but many others will be
blocked by abstraction barriers, chiefly dynamically-dispatched calls.
Therefore, if Swift is to allow precise performance control, it is
important to be able to allow motivated users to selectively control
the emission of reference-counting operations.
This sort of control necessarily permits the creation of dangling weak
references and so is not safe.
Proposal Overview
=================
Looking at these use-cases, there are two main thrusts:
- There is a general need to set up back references to objects.
These references must be designed for convenient use by non-expert
users.
- There are a number of more sophisticated use cases which require
notification or interruption of deallocation; these can be used in
the implementation of higher-level abstractions like weak caches.
Here it is reasonable to expect more user expertise, such that
power and flexibility should take priority over ease of use.
The second set of use cases should be addressed by library types working
on top of basic runtime support.
The first set of use cases will require more direct language support.
To that end, I propose adding two new variable attributes,
:code:`@weak` and :code:`@unowned`. I also propose a small slate of
new features which directly address the problem of capturing a value
in a closure with a different strength of reference than it had in the
enclosing context.
Proposed Variable Attributes
============================
In the following discussion, a "variable-like" declaration is any
declaration which binds a name to a (possibly mutable) value of
arbitrary type. Currently this is just :code:`var`, but this proposal
also adds :code:`capture`, and we may later add more variants, such as
:code:`const` or :code:`val` or the like.
:code:`@weak`
--------------
:code:`weak` is an attribute which may be applied to any
variable-like declaration of reference type :code:`T`. For
type-system purposes, the variable behaves like a normal
variable of type :code:`Optional<T>`, except:
- it does not maintain a +1 reference count invariant and
- loading from the variable after the current referent (if present)
has started destruction will result in a :code:`Nothing` value,
indistinguishable from the normal case.
The semantics are quite similar to weak references in other
environments (particularly Objective-C) except that the change in
formal type forces the user of the value to check its validity before
using it.
It doesn't really matter how the compiler would find the type
:code:`Optional<T>`; compiler-plus-stdlib magic, most likely. I do
not think the type should be :code:`Weak<T>` because that would
effectively make this a type attribute and thus make it too easy to
accidentally preserve the value as a weak reference. See the section
discussing type attributes vs. declaration attributes.
Giving the variable a consistent type of :code:`Optional<T>` permits
the user to assign :code:`Nothing` into it and therefore removes the
somewhat odd expressive possibility of a reference that can only be
missing if the object has been deallocated. I think this is an
acceptable cost of making a cleaner feature.
One alternative to using :code:`Optional<T>` would be to simply treat
the load as a potentially-failing operation, subject to the (not yet
precisely designed) language rules for error handling. This would
require the runtime to potentially synthesize an error event, which
could then propagate all the way to the end-user if the programmer
accidentally failed to check the result in a context that permitted
error propagation outwards. That's bad.
A slightly different alternative would be to treat it as a form of
error orthogonal to the standard user-error propagation. This would
be cleaner than changing the type of the variable but can't really be
designed without first having a solid error-handling design.
:code:`@unowned`
-----------------
:code:`unowned` is an attribute which may be applied to any
"variable-like" declaration of reference type :code:`T`. For
type-system purposes, the variable behaves exactly like a normal
variable of type :code:`T`, except:
- it does not maintain a +1 reference count invariant and
- loading from the variable after the referent has started
destruction causes an assertion failure.
This is a refinement of :code:`weak` focused more narrowly on the case
of a back reference with relatively tight validity invariants. This
is also the case that's potentially optimizable to use dangling weak
references; see below.
This name isn't really optimal. We've considered several different
candidates:
- :code:`weak` is a poor choice because our semantics are very
different from weak references in other environments where it's
valid to access a cleared reference. Plus, we need to expose
those semantics, so the name is claimed.
- :code:`backref` is strongly evocative of the major use case in the
static reference graph; this would encourage users to use it for
back references and to consider alternatives for other cases, both
of which I like. The latter also makes the husk-leaking
implementation (see below) more palatable. It also contrasts very
well with :code:`weak`. However, its evocativeness makes it
unwieldy to use for local reference-counting optimizations.
- :code:`dangling` is more general than :code:`backref`, but it has
such strong negative associations that it wouldn't be unreasonable
for users to assume that it's unsafe (with all the pursuant
debugging difficulties) based on the name alone. I don't think
we want to discourage a feature that can help users build tighter
invariants on their classes.
- :code:`unowned` is somewhat cleaner-looking, and it isn't as tied
to a specific use case, but it does not contrast with :code:`weak`
*at all*; only someone with considerable exposure to weak
references would understand why we named each one the way we did,
and even they are likely to roll their eyes at us. But it's okay
for a working proposal.
Asserting and Uncheckable
.........................
There should not be a way to check whether a :code:`unowned`
reference is still valid.
- An invalid back-reference is a consistency error that
we should encourage programmers to fix rather than work
around by spot-testing for validity.
- Contrariwise, a weak reference that might reasonably be
invalidated during active use should be checked for validity
at *every* use. We can provide a simple library facility
for this pattern.
- Permitting implicit operations like loads to fail in a
recoverable way may end up complicating the language model
for error-handling.
- By disallowing recovery, we create a model where the only
need to actually register the weak reference with the system
is to enable a consistency check. Users who are confident
in the correctness of their program may therefore simply
disable the consistency check without affecting the semantics
of the program. In this case, that leaves the variable a
simple dangling-weak reference.
Implementation
..............
The standard implementation for a weak reference requires the
address of the reference to be registered with the system so
that it can be cleared when the referent is finalized. This
has two problems:
- It forces the runtime to maintain a side-table mapping
objects to the list of weak references; generally this
adds an allocation per weakly-referenced object.
- It forces the representation of weak references to either
be non-address-invariant or to introduce an extra level of
indirection.
For some use cases, this may be warranted; for example, in
a weak cache it might come out in the noise. But for a simple
back-reference, these are substantial penalties.
Dave Z. has proposed instead using a weak refcount, analogous to a
strong refcount. Ownership of a weak retain can be easily transferred
between locations, and it does not require a side-table of an object's
weak references. However, it does have a very important downside:
since the system cannot clear all the references, it is impossible to
actually deallocate an object that is still weakly-referenced
(although it can be finalized). Instead, the system must wait for all
the weak references to at least be accessed. We call this "husk
leaking".
This downside could be a problem for a general weak reference.
However, it's fine for a back-reference, which we expect to typically
be short-lived after its referent is finalized.
Declaration Attribute or Type Attribute
---------------------------------------
This proposal describes :code:`weak` and :code:`unowned` as
declaration attributes, not type attributes.
As declaration attributes, :code:`@unowned` and :code:`weak` would be
permitted on any :code:`var` declaration of reference type. Their
special semantics would be a property only of the declaration; in
particular, they would not change the type (beyond the shift to
:code:`Optional<T>` for :code:`weak`) , and more generally the
type-checker would not need to know about them. The implementation
would simply use different behavior when loading or storing that
variable.
As a type attribute, :code:`weak` and :code:`@unowned` would be
permitted to appear at arbitrary nested locations in the type system,
such as tuple elements, function result types, or generic arguments.
It would be possible to have both lvalues and rvalues of so-qualified
type, and the type checker would need to introduce implicit
conversions in the right places.
These implicit conversions require some thought. Consider code like
the following::
func moveToWindow(_ newWindow : Window) {
var oldWindow = self.window // an @unowned back reference
oldWindow.hide() // might remove the UI's strong reference
oldWindow.remove(self)
newWindow.add(self)
}
It would be very unfortunate if the back-reference nature of the
:code:`window` property were somehow inherited by :code:`oldWindow`!
Something, be it a general rule on loading back-references or a
type-inference rule, must introduce an implicit conversion and cause
the :code:`unowned` qualifier to be stripped.
That rule, however, is problematic for generics. A key goal of
generics is substitutability: the semantics of generic code should
match the semantics of the code you'd get from copy-pasting the
generic code and substituting the arguments wherever they're written.
But if a generic argument can be :code:`@unowned T`, then this
won't be true; consider::
func foo<T>(x : T) {
var y = x
...
}
In substituted code, :code:`y` would have the qualifier stripped and
become a strong reference. But the generic type-checker cannot
statically recognize that this type is :code:`unowned`-qualified, so
in order to match semantics, this decision must be deferred until
runtime, and the type-checker must track the unqualified variant of
:code:`T`. This adds a great deal of complexity, both to the
implementation and to the user model, and removes many static
optimization opportunities due to potential mismatches of types.
An alternative rule would be to apply an implicit "decay" to a strong
reference only when a type is known to be a :code:`unowned` type. It
could be argued that breaking substitution is not a big deal because
other language features, like overloading, can already break it. But
an overlapping overload set with divergent behavior is a poor design
which itself violates substitution, whereas this would be a major
unexcused deviation. Furthermore, preserving the weakness of a
reference is not a safe default, because it permits the object to be
destroyed while a reference is still outstanding.
In addition, any implementation model which permits the safety checks
on :code:`unowned`\ s to be disabled will require all code to agree about
whether or not the checks are enabled. When this information is tied
to a declaration, this is easy, because declarations are ultimately
owned by a particular component, and we can ask how that component is
compiled. (And we can demand that external APIs commit to one level
of safety or the other before publishing.) The setting for a type, on
the other hand, would have to be determined by the module which "wrote
the type", but again this introduces a great deal of complexity which
one can only imagine settling on the head of some very confused user.
For all these reasons, I feel that making :code:`weak` and
:code:`unowned` type attributes would be unworkable. However, there
are still costs to making them declaration attributes:
- It forces users to use awkward workarounds if they want to
make, say, arrays of back-references.
- It makes back-references less composable by, say, preventing
them from being stored in a tuple.
- It complicates SIL and IR-gen by making the rules for manipulating a
physical variable depend on more than just the type of the variable.
- It automatically enables certain things (like passing the address of
a :code:`@unowned` variable of type :code:`T` to a :code:`inout T`
parameter) that perhaps ought to be more carefully considered.
The first two points can be partly compensated for by adding library
types to wrap a back-reference. Accessing a wrapped reference will
require extra syntax, and it runs the same risk of accidentally
preserving the weakness of a reference that I discussed above.
However, note that these problems are actually at odds: requiring
extra syntax to access the wrapped reference will leave breadcrumbs
making it clear when the change-over occurs. For more on this,
see the library section.
:code:`weak`-Capable Types
--------------------------
Swift reference types can naturally be made to support any kind of
semantics, and I'm taking it on faith that we could enhance ObjC
objects to support whatever extended semantics we want. There
are, however, certain Swift value types which have reference-like
semantics that it could be useful to extend :code:`weak` (and/or
:code:`unowned`) to:
- Being able to conveniently form an optional back-reference seems
like a core requirement. If :code:`weak` were a type attribute,
we could just expect users to write :code:`Optional<@weak T>`;
as a declaration attribute, this is substantially more difficult. I
expect this to be common enough that it'll be unreasonable to ask
users to use :code:`Optional<WeakReference<T>>`.
- Being able to form a back-reference to a slice or a string seems
substantially less important.
One complication with extending :code:`weak` to value types is that
generally the implementing type will need to be different from the
underlying value type. Probably the best solution would be to hide
the use of the implementing type from the type system outside of the
wellformedness checks for the variable; SIL-gen would lower the field
to its implementing type using the appropriate protocol conformances.
As long as we have convenient optional back-references, though, we
can avoid designing a general feature for 1.0.
Generic Weak Support
--------------------
All other uses for weak references can be glossed as desiring
some amount of additional work to occur when the strong reference
count for an object reaches zero. This necessarily entails a
global side-table of such operations, but I believe that's
acceptable as long as it's relegated to less common use-cases.
It is important that the notification mechanism not require
executing code re-entrantly during the finalization process.
I suggest adopting an interface centered around the Java
concept of a :code:`ReferenceQueue`. A reference structure
is registered with the runtime for a particular object with
a particular set of flags and an optional reference queue::
struct Reference {
void *Referent; // must be non-null upon registration
struct ReferenceQueue *Queue; // must be valid or null
size_t Reserved[2];
};
void swift_registerReference(struct Reference *reference,
size_t flags);
The user/library code is responsible for allocating these structures
and initializing the first two fields, and it may include arbitrary
fields before or after the :code:`Reference` section, but while the
reference is registered with the runtime, the entire :code:`Reference`
section becomes reserved and user/library code must not access it in
any way.
The flags include:
- A priority. Should be constrained to two or three bits. References
are processed in order of decreasing priority; as long as a
reference still exists with higher priority, references with lower
priority cannot be processed. Furthermore, as long as any reference
exists, the referent cannot be finalized.
- Whether to automatically clear the reference when processing it.
Note that a cleared reference is still considered to be
registered with the runtime.
These could be combined so that e.g. even priorities cause
an automatic clear and odd priorities do not; this would avoid some
odd effects.
The runtime may assume that explicit user operations on the same
reference will not race with each other. However, user operations on
different references to the same referent may be concurrent, either
with each other or with other refcount operations on the referent.
The operations on references are as follows::
void *swift_readReference(struct Reference *reference);
This operation atomically either produces a strong reference to the
referent of the given object or yields :code:`null` if the referent
has been finalized (or if the referent is :code:`null`). The
reference must currently be registered with the runtime::
void swift_writeReference(struct Reference *reference,
void *newReferent);
This operation changes the referent of the reference to a new object,
potentially :code:`null`. The argument is taken at +0. The reference
must currently be registered with the runtime. The reference keeps
the same flags and reference queue::
void swift_unregisterReference(struct Reference *Reference);
This operation clears a reference, removes it from its reference
queue (if it is enqueued), and unregisters it from the runtime.
The reference must currently be registered with the runtime.
I propose the following simple interface to a ReferenceQueue;
arguably, however, it ought to be a reference-counted library
type with a small amount of native implementation::
struct ReferenceQueue;
struct ReferenceQueue *swift_createReferenceQueue(void);
Allocate a new reference queue::
void swift_destroyReferenceQueue(struct ReferenceQueue *queue);
Destroy a reference queue. There must not be any references with
this queue currently registered with the runtime::
struct Reference *swift_pollReferenceQueue(struct ReferenceQueue *queue);
Proposed Rules for Captures within Closures
===========================================
When a variable from an enclosing context is referenced in a closure,
by default it is captured by reference. Semantically, this means that
the variable and the enclosing context(s) will see the variable as a
single, independent entity; modifications will be seen in all places.
In terms of the reference graph, each context holds a strong reference
to the variable itself. (In practice, most local variables captured
by reference will not require individual allocation; usually they will
be allocated as part of the closure object. But in the formalism,
they are independent objects.)
Closures therefore make it fairly easy to introduce reference cycles:
for example, an instance method might install a closure as an event
listener on a child object, and if that closure refers to
:code:`self`, a reference cycle will be formed. This is an
indisputable drawback of an environment which cannot collect reference
cycles.
Relatively few languages both support closures and use
reference-counting. I'm not aware of any that attempt a language
solution to the problem; the usual solution is to the change the
captured variable to use weak-reference semantics, usually by copying
the original into a new variable used only for this purpose. This is
annoyingly verbose, clutters up the enclosing scope, and duplicates
information across multiple variables. It's also error-prone: since
both names are in scope, it's easy to accidentally refer to the wrong
one, and explicit capture lists only help if you're willing to be very
explicit.
A better alternative (which we should implement in Objective-C as
well) is to permit closures to explicitly specify the semantics under
which a variable is captured. In small closures, it makes sense to
put this near the variable reference; in larger closures, this can
become laborious and redundant, and a different mechanism is called for.
In the following discussion, a *var-or-member expression* is an
expression which is semantically constrained to be one of:
- A reference to a local variable-like declaration from an
enclosing context.
- A member access thereof, possibly recursively.
Such expressions have two useful traits:
- They always end in an identifier which on some level meaningfully
identifies the object.
- Evaluating them is relatively likely (but not guaranteed) to not
have interesting side effects, and so we feel less bad about
apparently shifting their evaluation around.
Decorated Capture References
----------------------------
Small closures are just as likely to participate in a reference cycle,
but they suffer much more from extraneous syntax because they're more
likely to be center-embedded in interesting expressions. So if
there's anything to optimize for in the grammar, it's this.
I propose this fairly obvious syntax::
button1.setAction { unowned(self).tapOut() }
button2.setAction { if (weak(self)) { weak(self).swapIn() } }
The operand is evaluated at the time of closure formation. Since
these references can be a long way from the top of the closure, we
don't want to allow a truly arbitrary expression here, because the
order of side-effects in the surrounding context could be very
confusing. So we require the operand to be an :code:`expr-var-or-member`.
More complicated expressions really ought to be hoisted out to a
separate variable for legibility anyway.
I do believe that being able to capture the value of a property
(particularly of :code:`self`) is very important. In fact, it's
important independent of weak references. It is often possible to
avoid a reference cycle by simply capturing a specific property value
instead of the base object. Capturing by value is also an
expressivity improvement: the programmer can easily choose between
working with the property's instantaneous value (at the time the
closure is created) or its current value (at the time the closure is
invoked).
Therefore I also suggest a closely-related form of decoration::
button3.setAction { capture(self.model).addProfitStep() }
This syntax would force :code:`capture` to become a real keyword.
All of these kinds of decorated references are equivalent to adding a
so-attributed :code:`capture` declaration (see below) with a nonce
identifier to the top of the closure::
button1.setAction {
capture @unowned _V1 = self
_V1.tapOut()
}
button2.setAction {
capture @weak _V2 = self
if (_V2) { _V2.swapIn() }
}
button3.setAction {
capture _V3 = self.model
_V3.addProfitStep()
}
If the operand of a decorated capture is a local variable, then that
variable must not be the subject of an explicit :code:`capture`
declaration, and all references to that variable within the closure
must be identically decorated.
The requirement to decorate all references can add redundancy, but
only if the programmer insists on decorating references instead of
adding an explicit :code:`capture` declaration. Meanwhile, that
redundancy aids both maintainers (by preventing refactors from
accidentally removing the controlling decoration) and readers (who
would otherwise need to search the entire closure to understand how
the variable is captured).
Captures with identical forms (the same sequence of members of the
same variable) are merged to the same capture declaration. This
permits type refinement to work as expected, as seen above with the
:code:`weak` capture. It also guarantees the elimination of some
redundant computation, such as with the :code:`state` property in this
example::
resetButton.setAction {
log("resetting state to " + capture(self.state))
capture(self.model).setState(capture(self.state))
}
I don't see any immediate need for other kinds of capture decoration.
In particular, I think back references are likely to be the right kind
of weak reference here for basically every use, and I don't think that
making it easy to capture a value with, say, a zeroable weak reference
is important. This is just an intuition deepened by hallway
discussions and close examination of a great many test cases, so I
concede it may prove to be misguided, in which case it should be easy
enough to add a new kind of decoration (if we're willing to burn a
keyword on it).
:code:`capture` Declarations
----------------------------
This feature generalizes the above, removing some redundancy in large
closures and adding a small amount of expressive power.
A :code:`capture` declaration can only appear in a closure: an
anonymous closure expression or :code:`func` declaration that appears
directly within a function. (By "directly" I mean not within, say, a
local type declaration within the function). :code:`capture` will
need to at least become a context-sensitive keyword.
A closure may contain multiple :code:`capture` declarations, but they
must all appear at the very top. One reason is that they can affect
the capture semantics within the closure, so someone reading the
closure should be able to find them easily. Another reason is that
they can involve executing interesting code in the enclosing context
and so should reliably appear near the closure formation site in the
source code::
decl ::= decl-capture
decl-capture ::= 'capture' attribute-list '=' expr-var-or-member
decl-capture ::= 'capture' attribute-list decl-capture-expr-list
decl-capture-expr-list ::= expr-var-or-member
decl-capture-expr-list ::= expr-var-or-member ',' decl-capture-expr-list
Both forms of :code:`capture` declaration cause one or more fields to
be created within the closure object. At the time of creating the
closure, these fields are initialized with the result of evaluating an
expression in the enclosing context. Since the expression is
evaluated in the enclosing context, it cannot refer to "previous"
captures; otherwise we could have some awkward ambiguities. I think
we should reserve the right to not execute an initializer if the
closure will never be called; this is more important for local
:code:`func` declarations than for anonymous closure expressions.
The fields introduced by :code:`capture` declarations should be
immutable by default, but programmers should be able to write
something like :code:`capture var ...` to make them mutable. Locking
down on mutation isn't quite as important as it is with implicit
captures (where it's easy to accidentally believe you're modifying the
enclosing variable) or even explicit captures in C++11 lambdas (where
copies of the lambda object will copy the capture field and thus
produce mystifying behavior in uncareful code), but it's still a
source of easy mistakes that should require manual intervention to
enable. This all presumes that we eventually design mutability into
the language, of course.
In the pattern-initializer form, the field names are given explicitly
by the pattern. The abbreviated form borrows the name of the captured
member or local variable. In either case, names should be subject to
the usual shadowing rules, whatever they may be, with the exception
that capturing an enclosing variable with the abbreviated form is not
problematic.
Attributes on a :code:`capture` declaration affect all the fields it
declares.
Let's spell out some examples. I expect the dominant form to be a
simple identifier::
capture @unowned foo
This captures the current value of whatever :code:`foo` resolves to
(potentially a member of :code:`self`!) and binds it within the
closure as a back-reference.
Permitting the slightly more general form::
capture window.title
allows users to conveniently capture specific values without mucking
up the enclosing scope with tons of variables only needed for setting
up the closure. In particular, this makes it easy to capture specific
fields out of an enclosing :code:`self` object instead of capturing
the object itself; that, plus forcing uses of :code:`self` to be
explicit in closures, would help users to conveniently avoid a class
of inadvertent retain cycles.
I've included the general pattern-initializer form mostly for ease of
exposition. It adds no major expressivity improvements over just
creating a variable in the enclosing context. It does avoid
cluttering the enclosing scope with new variables and permits captures
to be locally renamed, and it can very convenient if introducing a new
variable in the enclosing scope would be complicated (for example, if
there were a reason to prefer using a single statement there). I
don't think it does any harm, but I wouldn't mourn it, either. I do
think that generalizing the initializer to an arbitrary expression
would be a serious mistake, because readers are naturally going to
overlook code which occurs inside the closure, and promoting side
effects there would be awful.
It would be nice to have a way to declare that a closure should not
have any implicit captures. I don't have a proposal for that right now,
but it's not important for weak references.
Nested Closures
---------------
It is important to spell out how these rules affect captures in nested
closures.
Recall that all of the above rules can be transformed into
``capture`` declarations at the beginning of a closure, and that
``capture`` declarations always introduce a by-value capture
instead of a by-reference capture.
A by-reference capture is always of either a local variable or a
``capture`` declaration. In neither case do we currently permit
such captures to "drag in" other declarations silently, to the extent
that this is detectable. This means that mutable ``capture``
declarations that are themselves captured by reference must be
separately allocated from the closure object, much like what happens
with normal locals captured by reference.
I've considered it quite a bit, and I think that a by-value capture of
a variable from a non-immediately enclosing context must be made
ill-formed. At the very least, it must be ill-formed if either the
original variable is mutable (or anything along the path is, if it
involves properties) or the capture adds ``@unowned``.
This rule will effectively force programmers to use extra variables or
``capture``\ s as a way of promising validity of the internal
captures.
The motivation for this prohibition is that the intent of such
captures is actually quite ambiguous and/or dangerous. Consider
the following code::
async { doSomething(); GUI.sync { unowned(view).fireCompleted() } }
The intent of this code is to have captured a back-reference to the
value of :code:`view`, but it could be to do so at either of two
points in time. The language must choose, and in this hypothetical it
must do so without further declaration of intent and without knowledge
of when and how many times the closures will be called.
Suppose that we capture the value at the earlier point, when we form
the outer closure. This will behave very surprisingly if :code:`view`
is in fact mutated; it may be easier to imagine this if, instead of a
simple local variable, it was instead a path like :code:`self.view`.
And it's not clear that forming a back-reference at this earlier point
is actually valid; it is easy to imagine code that would rely on the
intermediate closure holding a strong reference to the view.
Furthermore, and crucially, these issues are inextricable: we cannot
somehow keep track of the mutable variable but only hold its value
weakly.
But suppose instead that we capture the value at the later point.
Then the intermediate closure will capture the :code:`view` variable
by reference, which means that in effect it will hold :code:`view`
strongly. This may actually completely subvert the user's desired
behavior.
It's not clear to me right now whether this problem applies equally to
explicit :code:`capture` declarations. Somehow decorated expressions
seem more ambiguous in intent, probably because the syntax is more
thoughtlessly convenient. On the other hand, making the decoration
syntax not just a shorthand for the explicit declarations makes the
model more complex, and it may be over-complex already.
So in summary, it would be best to enforce a strong prohibition against
these dangerous multi-level captures. We can tell users to introduce
secondary variables when they need to do subtle things across several
closure levels.
Library Directions
==================
The library should definitely provide the following types:
- :code:`UnownedReference<T>`: a fragile value type with a single
public :code:`unowned` field of type :code:`T`. There should be an
implicit conversion *from* :code:`T` so that obvious initializations
and assignments succeed. However, there should not be an implicit
conversion *to* :code:`T`; this would be dangerous because it could
hide bugs introduced by the way that e.g. naive copies into locals
will preserve the weakness of the reference.
In keeping with our design for :code:`unowned`, I think this type
should actually be an alias to either
:code:`SafeUnownedReference<T>` or :code:`UnsafeUnownedReference<T>`
depending on the current component's build settings. The choice
would be exported in binary modules, but for cleanliness we would
also require public APIs to visibly commit to one choice or the
other.
- :code:`WeakReference<T>`: a fragile value type with a single public
:code:`weak` field of type :code:`T`. As above, there should be an
implicit conversion *from* :code:`T` but no implicit conversion to
:code:`T` (or even :code:`Optional<T>`). There is, however, no
need for safe and unsafe variants.
The library should consider providing the following types:
- A simple, memory-sensitive weak cache.
- :code:`Finalizer`: a value type which is constructed with a referent
and a :code:`() -> ()` function and which causes the function to be
run (on a well-known dispatch queue?) when the object is finalized.