Thursday, September 14, 2017

Independence Criteria for Implementations

As explained in the previous post Counting Implementations: The Problem of Size, the measure of a conscious computation within a system will be assumed to be proportional to the number of independent implementations of that computation. Independence Criteria for Implementations (ICI) must be chosen appropriately, and all possible mappings must be considered so as to maximize the number of simultaneous independent implementations for all conscious computations.

Intuitively as well as based on human observations, measure should not be a super-strong function of brain size or flexibility, so criteria that lead to exponential dependence on size are unlikely to be correct. In particular, mappings that include a functional channel plus an arbitrary combination of additional channels (any of which could have been a functional one given different initial conditions) would not be independent (intuitively) as the functional channel does all of the explanatory work about what computations are occurring, and as (problematically) the number of possible combinations grows exponentially with the number of channels. However, the Born Rule suggests that the size (squared amplitude) of terms in the wavefunction is indeed a factor, either directly or indirectly.


ICI proposal 1: Substate-style Criteria

For substates within a computation, the criterion for their independence from each other is roughly speaking that each substate should depend (in whole or in part) on a different physical variable (which may be either actual variables much like particle positions in classical configuration space, or an index which a group of variables depends on much like the quantum wavefunction depends on directions in particle-configuration space). Could a similar idea hold for independence of implementations?

Consider a digital variable that can take on 4 possible values. It can correspond to a pair of bits with possible values 00, 01, 10, 11. Within a computation, these can not be considered as independent substate bits because allowing that sort of thing would open the door to clock-and-dial-style false implementations. But for two different implementations (whether of the same computation or of different ones), each of which is by itself an allowed implementation, there seems to be no problem with allowing each of the above bits to play a role in a different one of the implementations. So while it is possible to use similar independence criteria across implementations as within them, that seems unnecessarily restrictive, and I do not find it intuitively appealing.

But that's not the only problem. Consider a system with N channels each of which can perform the same kinds of computations, similar to the earlier example of the Problem of Size, but for this example, suppose that it is a system with linear dynamics. It could for example be a quantum system, although any linear system would do.

For the initial conditions suppose that channel #1 is set up to perform a computation of interest, while the other channels are in their static conditions. For a quantum system, that means that the wavefunction is zero in the other channels, and remains so.

Consider mappings which are based on sums over various combinations of the channels. For example, a mapping M1 maps values of sums in corresponding substates of channels #1, 3, 5, and 8 to computational states. Another mapping M2 uses different channels, such as #1, 2, 3, 4, and 5.

How many implementations of the computation are performed by this system? If the Born Rule can be derived for this system, that depends on the amplitude of the wavefunction, but that's not what I want to focus on yet. In any case, the substate-style criterion leaves no direct role for the amplitude, though see below for how it might play an indirect role based on overcoming 'noise' in a many-particle world.

The question now is: How does the number of implementations depend on the number of channels, N? Intuitively, because there is no activity in channels other than #1, the number of channels should not affect the number of implementations. But mappings that combine different channels linearly, such as M1 and M2 do, satisfy the substate-style independence criteria. If each new mapping can either include or reject each channel (other than channel #1, which must be included in order to implement the computation), there are 2^(N-1) such mappings. This is actually an overestimate, because for any pair of mappings there must be at least one channel that the first includes but the second does not, and also vice versa. For example, the mapping that includes all odd # channels is not independent of the one that includes all channels. Still, this is not the proper dependence on N, which should be none.

It can be concluded that the substate-style criteria are not appropriate.


ICI proposal 2: Full Independence

An alternative is to allow any set of mappings such that being given each implementation mapping and the corresponding computational state places no restrictions on what computational states any of the other mappings correspond to. For the case of the classical ball system considered for the Problem of Size this rules out the problematic arbitrary combinations of channels, because all of the mappings using the same functional channel would have to be in the same computational state (or in one which can be mapped to it using similar substates and transition rules). It also allows the digital variable with 4 possible values to play a role in two different implementations.

For the linear system with N channels, this alone is not sufficient, because the same problem that the substate-style criteria encountered applies here. An added restriction can be placed as follows:

Change in Private:
If there is a region either of physical variables, physical indices, or physical values which is not shared by the mappings:
Both for initial and subsequent time steps, a difference in any computational sub-state that the mapping uses the non-shared region to determine must require a corresponding difference in the non-shared region, so that if there is no such difference there, no computational state is mapped to. For a given mapping, the shared region may or may not have to change also (not restricted by this condition).

This solves the N-channel problem because the non-shared channels don't meet the above conditions if the mappings give computational states that are determined by the state of channel #1 when the other channels remain at zero wavefunction.

The reference to a nonshared region of "physical values" for a particular physical variable is something that wouldn't make sense for substate independence within a single mapping, since all substates must be mapped to simultaneously within a single mapping. For different implementations, there is no such restriction.

What does it imply for quantum mechanics? That question will be studied in more detail in later posts, but note that scaling up wavefunction amplitudes does not appear to open up any possibilities for additional implementations under these criteria, for either of the above ICI proposals. As will be explored, the only way to produce the appearance of the Born Rule under such circumstances would be if a fixed 'noise level' in the wavefunction were somehow competing with the 'signal' and if increasing the amplitude allows more implementations indirectly because it allows a smaller volume of configuration space to support an implementation capable of beating the noise. This idea is interesting but has its own problems.


ICI proposal 3: Partial Independence (Note: This is a work in progress)

Knowing that the Born Rule is the goal, perhaps independence criteria could be found that satisfy the restrictions on independence suggested by the Problem of Size, while also opening the door to a more direct relationship between wavefunction amplitude and number of implementations. Increased complexity would argue against it, but if the problems of the competing-noise approach can be avoided that would be a point in its favor. Can such criteria be formulated, and if so, are they at all an intuitively reasonable possibility?

Wednesday, August 30, 2017

Counting implementations: The Problem of Size

Counting implementations: The Problem of Size

In order to get predictions out of a computationalist model, several ingredients are required:
1) Some way of determining which computations are implemented;
2) some way of determining which ones give rise to consciousness;
3) some prediction (which need not be exhaustive) about what such a consciousness would observe;
4) and some way of determining the effective probability of different types of such observations.

The implementation problem has already been addressed in previous posts on this blog. Problems 2) and 3) are hard in principle, but for present purposes, it should suffice to assume that the human brain performs the appropriate computations (whether analog, digital, or mixed) to give rise to human-like observations, and that systems that are not brain-like (or AI-like) give rise to no observations. Going beyond that assumption is not something that I will pursue, mainly because I have no way to do so.

The effective probability question is quantitative and the answer to it could have important implications. To start with, it would have to be the link between computationalism and the Born Rule of quantum mechanics, and if the Born Rule were shown to be incompatible with the effective probabilities of computationalism, then the model must be discarded (either by modifying the assumed physics or by rejecting computationalism). In practice, it may be enough to find a formula such that that the computationalist probabilities need not be incorrect, but it would be better to derive them on other grounds.

A formula for determining effective probabilities - which is to say, relative amounts of consciousness - could have other implications. For example, certain types of brain structures might have more consciousness than others, and if we know that they do, an argument could be made that those brains ought to receive more privileges. This is especially relevant to the prospects of humans peacefully coexisting with potentially conscious AIs. This could be a dangerous thing to study, but a sufficiently intelligent AI could probably study the question on its own, so for us to study it before AIs even exist might be a good idea.

The effective probability of an observation is the fraction of consciousness that makes that observation. The amount of consciousness will be assumed to be proportional to the "number of independent implementations" of each type of conscious computation. Criteria for "independence" in this context must be chosen for this purpose; at this point they may or may not be similar to the criteria for substate independence within a computation, although such similarity is certainly a hypothesis to explore.

In principle, the amount of consciousness might also depend on other characteristics of a computation, such as its complexity.

In the case of a system with continuous variables and/or infinite extent, there might be an infinite variety of independent implementation mappings, and in such a case a "correct" regularization must be used to find a ratio of finite numbers and then a limit taken towards infinity.

If there are were distinct systems each of which implements one computation, we need only count the systems of each type - in other words, just do a head count for each type of observation.

But in general, every system implements many computations. It is quite possible for one system to implement the same computation many times over - and indeed, that is what must happen if the MWI of QM is true. Even in classical mechanics though, it could certainly happen.

In there any sense in which the size of a computer affects the number of implementations which it performs? One might think so because a larger system could provide more ways to make mappings. On the other hand, it is not generally believed that the size of a computer would affect the amount of consciousness it gives rise to, and if it did it would have strange implications. Yet also, in MWI QM the "size" (in the sense of squared amplitude) of each branch of the wavefunction must indeed affect the effective probabilities if that model is correct.

Consider a simple computer that operates using the collisions of balls to create logic gates. There may be several parallel channels though which the balls can pass.

A variety of mappings could be made by including or not including each channel, with the one actually used always included. The number of such mappings would grow exponentially with the number of channels. Clearly such a strong dependence on size is problematic. It makes sense that these mappings would not be "independent" in the appropriate way, and so this exponential growth of measure can be ruled out. The exact criteria for such independence are still to be determined, but should reproduce that result.

What about the size of the ball itself? A mapping can be made for each small part of the ball to an appropriate computational substate, and these can be combined to form an (exponential) multitude of overall implementation mappings, each of which will have the correct causal behavior given appropriate restrictions. But these all must "implement" the same computation at the same time; any difference would fragment the balls and ruin the causal behaviors. These mappings, too, should not be considered independent.

Similarly, consider the light that reflects off of the ball. It will be correlated with the position of the ball, so it provides an alternative thing to make an implementation mapping with - at least for final computational states. But implementations based on these mappings will not be independent of each other or of those based on mappings from the ball itself.

Next, I'll consider some possible Independence Criteria for Implementations (ICI) that meet these restrictions, and the implications of each of these flavors of ICI.

Featured Post

Why MWI?

Before getting into the details of the problems facing the Many-Worlds Interpretation (MWI), it's a good idea to explain why I believe t...

Followers