Making A More Accurate And Sustainable AI Model
I had an opportunity to talk with the founders of a company called PiLogic recently about their approach to solving certain problems which they say can be solved faster and with less energy consumption than Large Language Models (LLMs). Their approach makes heavy use of exact probabilistic inference. PiLogic says that their inference engine is the most advanced in the world as benchmarked against Join Tree and other leading methods.
PiLogic is also submitting an application to join theInternational Telecommunications Union (ITU) green computing working group. They believe their methods may be useful for many general information and computing technology (ICT) applications.
This approach doesn’t require huge data sets and specialized expensive hardware such as Graphics Processing Units (GPUs). It has particular value for engineering use cases, doesn’t have hallucinations and gives results which are precise and accurate. It is currently targeted for use in aerospace and cyber security applications but the company believes that it could become a standard AI toolkit anywhere one needs answers grounded in mathematics, where mistakes are expensive, and where results need to conform to expert knowledge.
Some of the use cases are (1) autonomous systems, such as autonomous flight, (2) cybersecurity, such as Security Operations Center (SOC) flag management and automatic threat prediction and response, and (3) aerospace, such as identification and tracking by radar, and diagnosing and predicting electrical system failures on aircraft and spacecraft. The inference engine and AI tool kit can be applied to many complex problems in industries such as finance, energy, cloud and healthcare. The image below shows the PiLogic process flow including a Bayesian Network and an evidence-based inference engine.
The PiLogic engine operates on what are called Bayes Nets which possess a number of advantages over other types of models. For example, they can incorporate expert knowledge, handle limited training data, and facilitate analysis on why the model behaves as it does. One of the techniques used in the PiLogic engine generates an efficient Arithmetic Circuit (AC) from the Bayes Net. The image below shows dependencies in an AC generated from the Baynes Net.
One reason the AC is efficient is that it pushes most of the work involved in performing inference to a pre-deployment phase that only needs to run once. After deployment, the pre-deployment work can be amortized over large numbers of queries. A second reason is that post-deployment inference answers multiple queries simultaneously.
In addition to efficiency, ACs have other advantages. For example, it is possible to know precisely how much time and space is required to answer queries, and so the approach works well in the context of real-time requirements. Moreover, the AC can be embedded in many products and applications since it doesn’t require specialized hardware. These efficiency improvements also lead to energy savings for the entire inference process on an ongoing basis for end users.
In the chart at the top of the article, the “width” of the Bayesian Network, on the horizontal axis, is a reflection of how hard a network is for a conventional inference engine. Conventional inference engines run in time and space that is exponential to this width and as a consequence only work on networks having limited width, as shown below.
PiLogic says that it has found a way to break this exponential growth in calculation complexity for many problems. They do this by using structure in the problem, particularly local structure. This may be zeros or repeated values in the model that can simplify the calculations needed. As a consequence, PiLogic says that if there is sufficient local structure, they can solve problems with treewidth into the 100’s, as shown above. Note that if there is no such structure in the model, then the PiLogic engine would have the same width constraints as conventional inference engines.
Being able to deal with higher width problems makes it possible to use more robust models that can deal with problems that rarely occur in the training data. It can also allow using these models for more proactive rather than reactive applications since the model can learn from sources of knowledge other than raw historical data.
PiLogic has developed an AI modeling methodology that allows simplification of AI training using known structure in the data and the system being modeled. This allows faster training and inference where such structure exists and reduces energy consumption for many essential problems being addressed by advanced AI.