Maximal decomposition of information
Let be a relational database schema.
(the extension of the database constraint) can be regarded as a non-empty relation so we can consider its
unique prime Cartesian factorisation.
Claim:
For any schema there exists a schema
which is information equivalent to
where the cardinality of every finite prime Cartesian factor of is a prime number.
This implies the bag of integers which are the cardinalities of the prime
Cartesian factors of equals the bag of prime number factors of .
Note that the bag of prime number factors of any positive integer is uniquely determined, by
the Fundamental theorem of arithmetic [
].
represents a schema having maximal decomposition - i.e. into the most parts which can be updated
independently.
Proof
The following is an outline of a constructive existential proof which involves
relational expressions for each relvar in that are defined in terms of the
relvars of :
Let be the prime number factors of (the order doesn't matter)
Let the database values in be organised into an dimensional array of size
(again, the order doesn't matter).
For in and in ,
let denote the subset of corresponding to the set of database values in
the ith slice of the jth component of the n-dimensional array
(a slice has dimensions).
For in and in , let
where evaluates to DEE
if the dbvar for schema has value
, otherwise DUM
. See how it can be defined here.
Note that evaluates to DEE
if the value of the dbvar for is in slice , otherwise DUM
.
Let be some attribute name.
For in ,
let schema have relvar which is defined in terms of the relvars of using
the expression
REL { TUP { ( ) } }
(this expression evaluates to REL { TUP { ( ) } } where the dbvar of
is in slice )
Note that for every database value each relvar of has exactly one tuple with one
attribute. This follows because any database value (i.e. element of ) lies in
exactly one slice of the jth coordinate. The UNION over the JOINs picks out
exactly one of the TUP { ( ) }. There are slices for the jth component
and hence possible values of .
It is easy to see that a combination of the values of the corresponds to
one of the elements of the n-dimensional array and hence to one of the database values
in .
Example 1
Let there be three lights coloured red, green and blue which can be on or off.
There are world situations which need to be recorded.
Let schema have a single relvar with the predicate:
The light with colour [COLOUR] is on.
Let schema have three relvars with the following predicates:
- The red light is on.
- The green light is on.
- The blue light is on.
and are information equivalent. is an example of a schema with maximal decomposition.
Example 2
Let there be three lights coloured red, green and blue which can be on or off,
except they cannot be all off or all on.
There are world situations which need to be recorded.
The prime factorisation of is .
A maximally decomposed schema can be achieved with the following two predicates:
- A single light is on.
- The light with colour [COLOUR] is the only light in its state.
Example 3
See
maximal decompositon of two monadic relations with an IND constraint.