Chaitin, Wolfram, Godel, Taleb, data reducibility, linguistics, Semantic Web, Black Swans, J curve, Theory of U + Twain’s context paradigms to reach W (Web) equilibrium
Maybe I could have entitled it: “Twaining a Conscious Web”
I decided to write this post because of the passionate debates a handful of us have been having about AI, NLP and resolving the reducibility question. At the surface, this may seem strange or over theoretical but at its core it goes into how much more powerful and intelligent the machines we build can be and also whether there is such a thing as data consciousness, context and the ability to compute every theory.
If any readers think that 50-100 comments on a thread is substantial (and whenever I’m on a thread, they do seem to attract more comments from others — it’s quite bizarre how interested people become and want to actively contribute which is good), this seriously smart debate is a 530+ strong comments thread!!! So…………..at some point we’ll be able to solve the question of how the Universe was created — LOL.
Essentially, there are two key players whose work is being debated:
* Gregory Chaitin — data is not reducible and there are some things we just can’t know.
* Stephen Wolfram (of Wolfram Alpha) — data is reducible and we can compute the answer to every question.
Computer scientists are currently trying to break data down into forms and associations that can be more readily computed, connected and extracted. It’s well known that in NLP, meaning extraction is still proving to be problematic. Additionally, we have to factor in Nicholas Taleb’s “Black Swan” type probability anomalies if we’re going to be able to compute everything, apparently.
[Twain's observation: we can't currently compute the answer to every question --- for example, how much and why we love our parents, the existence of a Supreme Being or anything involving subjectivity that's culture affiliated --- and what we should be doing is innovating algorithms to contextualize as much of the data points as possible. Moreover, data is reducible and can be transformed for smarter extraction in ways we haven't explored yet.]
As well as the Chaitin and Wolfram, we also have to throw Godel’s two Incompleteness Theorems — which work their way through mathematical logic like so:
After all that (con)textual mathematics, here are the key Chaitin videos which encapsulate his position on “maximum unknowns”.
Now here’s Stephen Wolfram explaining the computability of everything:
So then how does social science and psychology strand into what is high-end mathematics? Well……….
Those of us who’ve studied macroeconomics are aware of the J curve theory from John Maynard Keynes:
A country’s trade deficit will worsen initially after the depreciation of its currency because higher prices on foreign imports will be greater than the reduced volume of imports.
The J curve theory has been adapted by management consultants like Gartner into a theory about technology hype cycles:
Meanwhile, Otto Scharmer in organizational behavior has proposed a different letter from J, U, to explain how we examine ourselves, our perspectives on the world and the way in which we solve problems:
Since both camps (vectorial scale algorithms approach to data reducibility versus Quantum Mechanics approach) have merits, I wrote:
This thread is observing a classic W approach to problem-solving: two schools of thought, approaching from either end, drilling down and shifting their vectorial positions as time elapses and finally inflecting upwards (with views on what their prior slopes looked like) until the two schools converge and are on a different plane from where each and both started.
Wrt whether the Semantic Web can become an inference engine………….not if it continues to deploy the taxonomies and categorizations it does because that still roots us in probabilities, correlations and the other facets of it being actually no fundamentally different from Google (which itself is a difference engine just as Babbage postulated — albeit instead of absolute real numbers and binaries, it’s about the difference between statistical points).
Now, if it was a …..true DIFFERENTIATION ENGINE, this would be a real leap forward rather than an imaginary one.
We cannot infer until we can differentiate and the Semantic Web cannot do this (yet).
Specifically on data reducibility and context, this is what I offered to the debate:
The dimensions of context for each of us is personal, experiential, spiritual / emotive and cultural.
Conversely, the dimensions of computing are impersonal, iterative, rational and culture agnostic.
Explications which seem clear, obvious and even underpinned by established/irrefutable science and mathematical equations are fine to follow if we’re conversing with another person schooled with the same scientific reference points as us. They’re not so clear, obvious and irrefutable when we’re conversing with a lawyer, a photographer or a linguist because their context points for deduction, perspective and language will be distributively different.
This is why when we surface a piece of raw data in a search engine list – suppose something as simple as the number 2 – the context of it is going to be interpreted in diverse ways. The literally-minded will perceive it just as the number after 1 with a value of 2. The mathematically-minded will think of it as being a prime as well as in terms of power series, halves and double integrals. The literately-minded will automatically associate it with ‘Tale of Two Cities’, “To be or not to be”, JRR Tolkien (‘Two Towers’) and “it takes two to tango”. The artistically-minded will see the image of a swan or one half of a heart because that’s the shape of a 2. The Spanish-speaking computer scientist will think of it as being DOS (Disk Operating System). The classical scholar will reference it to Janus, the god of two faces. The romantic would tie it with coupledom…..etcetcetc………Whilst the Chinese would word associate it with the homophone for “fish”, “happiness” and “prosperity” all intrinsically bound to each other.
So that’s an example of raw data carrying implicit context which is not currently being included in or extracted explicitly by algorithms.
How to resolve this so we can compute this context and not only the raw data?
I’m working on it, as they say.
And so……………..I am………………
Tags: Black Swan events, computing theory of everything, Godel's Incompleteness Theorem, Gregory Chaitin, J curve theory macroeconomics, Kurt Godel, Nicholas Taleb, Otto Scharmer Theory of U, Semantic Web, Stephen Wolfram, Twain's context paradigms, Twain's W (Web) equilibrium, Wolfram Alpha








