The Wall in the Machine: Why ARC-AGI-3 Still Stumps the World’s Smartest AI

The Wall in the Machine: Why ARC-AGI-3 Still Stumps the World’s Smartest AI

When Francois Chollet released the Abstraction and Reasoning Corpus (ARC) in 2019, it was intended as a sobering reality check for an industry intoxicated by its own hype. He proposed a series of visual puzzles that a five-year-old could solve in seconds, yet the most advanced neural networks of the time remained utterly baffled. Five years later, despite the meteoric rise of Large Language Models (LLMs) and trillions of dollars in market valuation, the latest analysis from ARC-AGI-3 reveals a stubborn truth: the machine is still hitting the same wall.

The latest benchmarks do not merely show that AI is "less smart" than humans; they pinpoint a structural failure in how synthetic intelligence processes logic. Even with the massive compute power behind the newest frontier models, three systematic reasoning errors have emerged as the "Achilles' heel" of modern artificial intelligence. These are not glitches that a simple software patch can fix. They are fundamental misalignments in the way silicon mimics thought.

Key Takeaways: The ARC-AGI-3 Revelations

The Memorization Trap: Modern models often rely on "probabilistic retrieval" rather than genuine novel reasoning.
Compositional Collapse: Systems struggle to combine two simple rules into a single complex action.
The Scale Paradox: More data and more parameters have failed to yield a significant breakthrough in fluid intelligence.
The Human Edge: Biological intelligence remains vastly superior at "System 2" thinking—the slow, deliberate logic required for unfamiliar tasks.

The Mirage of Intelligence: Pattern Matching vs. Reason
To understand the failure, one must first understand the test. ARC-AGI puzzles require a solver to look at a few examples of a grid changing color or shape and then predict the output for a new, unseen grid. There is no language to lean on, no "training data" that can perfectly predict the next pixel. It requires an internal mental model of physics, geometry, and persistence.

The first systematic error identified in the ARC-AGI-3 analysis is The Memorization Trap. LLMs are, at their core, sophisticated statistical mimics. When they encounter a problem, they aren't "thinking" in the human sense; they are calculating the mathematical likelihood of what the answer should look like based on the billions of pages they have read.

In ARC-AGI-3, researchers found that when puzzles were slightly modified to move away from common geometric patterns found on the internet, model performance plummeted. The AI wasn't reasoning through the change; it was trying to "lookup" a similar solution from its training. If the solution wasn't in the database, the machine went dark. This suggests that what we perceive as "reasoning" in chatbots is often just an incredibly high-resolution form of memory.


The Compositional Collapse: Where Logic Fragility Begins

The second error is perhaps more concerning for those hoping for Artificial General Intelligence (AGI) in the near term: Compositional Collapse.

In a typical ARC-AGI task, a user might need to identify a shape, rotate it, and then change its color based on its proximity to a border. These are three simple logical steps. While modern models can often perform any one of these tasks in isolation, they struggle immensely to chain them together accurately.

Human beings possess a "global workspace" in the brain that allows us to hold multiple rules in our head simultaneously. AI, however, suffers from a compounding error rate. If a model is $90\%$ accurate at Step A and $90\%$ at Step B, its chances of getting a five-step sequence right drop significantly. By the time it reaches the third or fourth logical "hop," the "noise" in the system outweighs the signal. This fragility is why AI can write a sonnet about a toaster but fails to solve a logic puzzle involving four colored squares.


The "Objectness" Problem: Why AI Struggles with Physicality

The third and most profound error highlighted by the ARC-AGI-3 data is a failure in Core Knowledge. Humans are born with an innate understanding of "objectness"—we know that an object exists even if it moves behind another, and we understand that shapes have boundaries.

The latest models still treat grids of data as a flat sea of tokens rather than a collection of independent objects. In the ARC-AGI-3 analysis, models frequently "hallucinated" parts of shapes or allowed objects to pass through one another in ways that violate basic spatial logic.

This is the "World Model" deficit. Because these systems are trained primarily on text, they lack the grounded experience of the physical world. They understand the word "gravity" as a linguistic concept, but they do not "understand" the downward pull of an object. When a puzzle requires an AI to "drop" a shape to the bottom of a grid, the AI often misses the mark because it doesn't truly grasp what "down" or "bottom" implies in a spatial context.


The Billion-Dollar Question: Is Scale Enough?

For years, the prevailing wisdom in Silicon Valley has been "The Scaling Hypothesis": the idea that if we just add more GPUs, more data, and more electricity, intelligence will eventually "emerge." The ARC-AGI-3 results act as a cold bucket of water on this theory.

Despite the jump from GPT-3 to the most recent iterations, the improvement on these specific reasoning tasks has been marginal compared to the massive increase in resources. We are seeing diminishing returns. We have built machines that are world-class at "System 1" thinking—the fast, intuitive, and often subconscious pattern recognition we use to recognize a face or finish a sentence. But we are nowhere near "System 2"—the slow, deliberate, and logical reasoning we use to solve a math problem or navigate an unfamiliar city.

The gap between these two systems is where the current generation of AI resides. It is a brilliant librarian that has read every book ever written but cannot figure out how to use a screwdriver if the instructions aren't written in a familiar font.


Beyond the Stochastic Parrot

If the ARC-AGI-3 analysis teaches us anything, it is that the path to true AGI likely requires a fundamental architectural shift. We may need systems that aren't just larger, but different—perhaps incorporating "neuro-symbolic" approaches that combine the pattern-matching brilliance of neural networks with the hard-coded logic of classical computing.

We must move away from the "black box" approach and toward systems that can explain their reasoning. A human solving an ARC puzzle can tell you: "I saw the red square, noticed it moved two spaces right every time, and applied that to the blue square." Current AI cannot provide this trail of logic because, quite frankly, there isn't one. There is only a probability distribution.

The stakes are higher than just winning a puzzle competition. If we intend to rely on AI for scientific discovery, legal analysis, or autonomous transit, we need systems that don't just guess correctly—we need systems that reason accurately.


The Lingering Question

As we push deeper into the decade of the algorithm, we find ourselves at a strange crossroads. We have created a mirror of human knowledge that is breathtaking in its scale, yet it remains hollow at the center.

The ARC-AGI-3 results remind us that there is a spark in human cognition—an ability to look at the entirely new and find the underlying order—that still eludes our best creations. The question is no longer how much more data we can feed the machine, but rather: Can we ever teach a machine to truly think for itself, or are we simply building a more perfect echo of ourselves?

Can Nepal not make its own vaccine?

 Can Nepal not make its own vaccine?


At that time, the demand for the vaccine against Covid was very high, while India did not have enough vaccine for its own citizens.



It is said that although there was a need for vaccine production in Nepal, policymakers ignored it and a vaccine production company should be established.


The context is during the Covid-19 pandemic. Nepal was importing 2 million Covid-19 vaccines from India. After sending the first batch of 1 million, the second 1 million vials could not arrive. The import of the vaccine to Nepal was stopped due to a court order there.


The main reason for this is that there was not enough vaccine for their country. After this, Delta, which is considered the most deadly of the Covid variants, badly affected Nepal after India. If the second 1 million vaccines had arrived at that time, perhaps many deaths could have been saved.


Similarly, the recent resurgence of Japanese encephalitis has seen an increase in the number of infected deaths in the last few years. Nepal must rely on imports for this vaccine as well. Apart from these, vaccines are considered the surest way to prevent the outbreak of many other infectious diseases.


Past and present experiences of these studies also show that as we enter the 21st century, vaccines are the only solution for new (like Covid) and previously controlled and resurgent diseases (like Japanese encephalitis).


Although some medicines are currently being produced domestically, there is no vaccine production. It seems that policymakers have almost ignored the need for vaccine production.


Is it that Nepal itself cannot produce a vaccine for use in humans? Or has the health sector not yet realized the need for it, been unable to do it, or is not interested?


There are not enough health centers or hospitals in Nepal. Moreover, health centers equipped with the necessary physical infrastructure and resources are even more limited. In such a situation, if any infection spreads or takes the form of an epidemic, there will be a shortage of hospital beds, resources, and health workers.


This increases the risk of patient death. When I was working at Teku Hospital two and a half decades ago, dozens of patients with diarrhea and cholera were admitted every hour during the rainy season.


Similarly, Japanese encephalitis has a high mortality rate, so dozens of people were admitted every week during the mosquito season. Most of those admitted had to die because they arrived at the hospital late.


Generally, the impact of Japanese encephalitis is greater in the Terai. Due to the lack of sufficient hospital beds and manpower there, they were forced to come to Kathmandu.


But later, after the vaccine against Japanese encephalitis was used in Nepal, the mortality rate decreased sharply. At that time, support for the vaccine came from neighboring China. The support itself is not bad, but the question is for how long?


Nepal had expected foreign support for the vaccine during the Covid epidemic. At Nepal's request, India agreed to provide the Indian-made vaccine called Covoshield. However, when the second batch was to be sent, the vaccine could not reach Nepal because the court there had ordered to give priority to its citizens first and not to export it.


This decision is not surprising. Because at that time, the demand for the vaccine against Covid was very high, while India did not have enough vaccine for its own citizens. At that time, 'vaccine diplomacy' was also very popular.


There was a competition among developed countries to develop the vaccine against Covid the fastest. At that time, Covid was present as a great enemy against humanity, and the world was working day and night to develop a vaccine to protect itself from it.


After the Covid vaccine was stopped from India, another neighboring country, China, received the Covid vaccine as assistance. This also shows how important vaccines are during major epidemics.


The role played by the Serum Institute of India during the Covid pandemic is also an example of how much relief can be provided in an epidemic when a vaccine manufacturing company is available.


Although Covid itself is a new disease, scientists were able to develop a vaccine faster than expected due to their hard work day and night. Naturally, the possibility of developing a new vaccine is also greater in developed countries due to the presence of high-quality research laboratories and excellent scientists.


During an epidemic of a highly infectious and deadly disease that terrifies the world, only a limited number of vaccines are produced by limited production organizations in limited countries. In this case, the possibility of sending it to other countries or the rest of the world is also reduced.


But if the vaccine formula or the 'components' used in the vaccine can be obtained and a vaccine manufacturing company is available, India can be taken as an example that vaccine production can be continued. If there was a vaccine manufacturing company in Nepal during the Covid pandemic, human losses could have been prevented to a large extent. It was a situation where we had to sit and stare at foreigners.


It has only been a few years since the Covid pandemic ended. Those moments are still very tragic, especially for those who have lost their relatives to Covid or who have managed to survive severe Covid.


But it has also taught us some lessons. 

Use of AI in healthcare: How useful, how dangerous?

 Use of AI in healthcare: How useful, how dangerous?


Doctors use AI in conjunction with their knowledge, experience, and patient conditions, and the risk increases when patients base their decisions on that.



Use of AI in healthcare: How useful, how dangerous?


AI is not a replacement for doctors, but a tool to expand their capabilities, and its responsible use in the healthcare sector is necessary.

Some time ago, during the confusing time when the Medical Education Commission announced the PG results, I created a ‘seat predictor’ tool using available data and AI.


Recently, when the actual results of the government seat came out, this tool of mine seemed to be ‘conservatively’ very safe. The tool had ‘underestimated’ the actual rank somewhat, so that doctors did not have false expectations and could make safe decisions.


I have also included its detailed description and how to use it in the description of the MD/MS video of my Bimarsha Acharya YouTube channel. This small experiment made me realize one big thing, that AI is not an ‘enemy’ for the Nepali healthcare sector, but rather a powerful ‘co-pilot’ for those who know how to use it correctly.


In this context, I have been conducting clinical research training sessions, in which I have also been regularly covering the use of AI, its ethical aspects and its responsible integration into daily medical practice.


In the process, I have trained more than 700 doctors and medical students in Nepal. This experience has further highlighted the need to use AI not just as a tool, but also in a safe and responsible way with proper guidance.


AI has become like a companion to me while seeing patients daily in the hospital. I use it regularly to remember medication doses or precautions, compare different treatment methods, align my decisions with international guidelines and understand the results of the latest research and trials. In complex cases, comparing your initial clinical thinking with evidence-based information makes decisions clearer and more confident. In this way, AI is a powerful tool to augment the capabilities of doctors, not replace them.


AI for doctors: Which is more useful?


The various AI tools in use today, such as Grok, Gemnai, ChatGPT, Perplexity, and OpenEvidence, have their own roles. However, their use varies depending on the context. ChatGPT, Gemnai, or Grok can be useful for understanding general information, clarifying concepts, and facilitating quick clarification. Perplexity presents information with sources, making it easier to search and compare. However, evidence-based, contextual, and up-to-date information is extremely important for clinical decisions.


OpenEvidence is considered particularly useful in this regard. This platform focuses on providing evidence-based information based on international journals, clinical trials, and established guidelines. It shows doctors not just the answer, but also the scientific basis for it, which makes clinical decisions safe, reliable, and accountable.


Therefore, while various AI tools can be used for general understanding, OpenEvidence is considered one of the most suitable options in the current situation as an evidence-based platform for clinical practice and decision-making.


The danger of relying on AI's advice


Nowadays, many patients have started using AI like doctors. There is an increasing trend of seeking medical advice directly after experiencing common symptoms, which can be a serious danger.


For example, if someone has a stomach ache, AI can recommend a medicine to relieve common pain. But a serious problem like appendicitis may be hidden within that symptom. Even if the medicine provides relief for some time, the disease may become more complicated.


This is where the difference between AI used by doctors and patients becomes clear. Doctors use AI by combining their knowledge, experience, and patient's condition, while patients directly base their decisions on it, which increases the risk. Self-medication can sometimes even put lives at risk.


AI in Nepal's health sector


In a country with geographical challenges like Nepal, AI can bring about a major change in healthcare. In remote areas where there is a lack of specialist doctors, AI can help in decision-making at the primary level. Its use in X-rays, cardiac tests or emergency assessment can guide timely treatment.


Combining AI with telemedicine can reduce the distance between villages and cities. Patients can get specialist services nearby, while doctors can also provide better service with limited resources.


AI can also play a big role in the research sector. It can help increase participation in complex studies, data analysis and international publications. This has the potential to make Nepal’s health system knowledge-based and technology-friendly.


Our responsibility now


The future competition will not be between doctors and AI, but between doctors who know how to use AI and those who do not. A system that cannot adapt with time will fall behind.


Therefore, it is necessary for both the government and the private sector to work together to formulate a clear policy to integrate AI into the health system. It is imperative to provide training, resources and incentives to doctors.


If we fail to embrace this technology today, we will be unable to compete globally tomorrow. But if we move in the right direction, Nepali health care The sector can establish its identity on an international level.


The question now is clear: will we lead the change or lag behind it?

Study Conclusion: Exercise Reduces Risk of Death from Alcohol Consumption

Study Conclusion: Exercise Reduces Risk of Death from Alcohol Consumption


A study conducted in the UK has shown that regular physical exercise reduces the risk of cancer and heart disease caused by alcohol consumption.


We all know that alcohol consumption affects health. It is advised to consume alcohol in limited quantities or to stay away from it to stay healthy.


The extent to which alcohol consumption affects a person depends on their physical condition, age, their lifestyle, etc. A study conducted in the UK has also shown that the effects of alcohol are reduced in people who exercise regularly.



The study, which was conducted with the aim of finding out whether physical activity reduces the harm caused by drinking alcohol, has been published in the British Journal of Sports Medicine.


This research was conducted on 36,370 people in the UK and Scotland. The study, which was conducted over a few years, also assessed the incidence of cancer and heart disease deaths caused by alcohol consumption.


The people participating in the research were divided into different groups, ranging from non-drinkers to heavy drinkers. In which those who never drank alcohol, those who drank before but have stopped, those who drink only occasionally, those who drink within a certain time limit, those who drink a lot and those who drink excessively were divided into separate groups.


On the other hand, groups were also determined based on physical activity. The study was divided into 3 groups: not at all active, moderately active and very active.


Heavy drinkers were found to have a 40 percent higher risk of death from cancer and heart disease. People who drink alcohol in limited quantities but are not physically active also had a higher risk.


People who also drink alcohol and keep themselves physically active had a lower risk of death from these diseases.


This study concluded that physical activity reduces the risk of alcohol on health.


According to experts, alcohol consumption creates a lot of pressure on the digestive process. It increases 'oxidative stress' in the body. Alcohol also affects the process of digesting fat in the body and plays a role in increasing cholesterol. High blood pressure also increases the risk of heart disease.


The study concluded that physical activity can reduce these risks to some extent.


Although physical exercise reduces the health risks associated with alcohol consumption, the study recommends limiting alcohol consumption and getting regular physical exercise to stay healthy.

Negative thinking affects the heart

 Negative thinking affects the heart


Stress increases the heart rate, increases blood pressure, and causes long-term heart problems.

According to a 2023 study by the American Heart Association, constant stress increases the risk of heart disease by up to 35 percent.

Doctors recommend 10-15 minutes of meditation, 30 minutes of exercise, and positive thinking daily to reduce stress.

A problem that is not visible on the surface but silently haunts most people on the inside is stress. From the outside, people appear normal, busy with their work. But inside, they are carrying the burden of worry, fear, and pressure. This invisible negative burden gradually weakens the body.



Negative thinking, anxiety, anger, or frustration is not only a problem for the mind, but also a big threat to the heart. Its effect on the heart is even deeper.


When you are constantly stressed, your heart starts beating faster, your blood pressure increases, and your body is forced to be on alert all the time. If this situation persists for a long time, the risk of heart problems increases.


How does stress affect the heart? What is the relationship between the heart and stress?


The relationship is very simple, the mind controls the body. When negative thoughts come, the body goes into 'stress mode'. The heart beats faster, blood pressure increases and the heart has to work harder. If this thought is short-lived, the heart is fine.


But if the pressure of daily work, financial problems, family disputes, comparisons with friends, negative news on social media or past tragic events continue to bother you, negative thoughts take up residence in the mind. This puts a burden on the heart and increases the stress associated with it.


Negative thoughts, especially anger, rage, hatred and malicious thoughts, have a profound effect on the heart because they overactivate the 'sympathetic nervous system'. This system is responsible for the 'fight or flight' response, that is, the body's natural response to immediately fight or flee when it senses danger.


When we get angry or have negative thoughts, the brain immediately sends signals that increase levels of stress hormones such as adrenaline, cortisol, and noradrenaline.


These hormones increase heart rate, raise blood pressure, and constrict blood vessels. Over time, this activation damages the inner lining of the heart’s blood vessels (endothelium). The endothelium loses its ability to dilate blood vessels, which reduces blood flow and accelerates the process of plaque buildup.


If the mind is not healthy, the heart will not be healthy


If the mind is not healthy, the heart will not be healthy either. Studies say so. According to a 2023 study by the American Heart Association, depression, anxiety, and chronic stress increase the risk of heart disease by up to 35 percent. Similarly, a large study by the British Heart Foundation also showed that stress at work or at home increases the risk of heart disease and stroke, albeit slightly.


Studies have shown that even a short bout of anger can reduce the dilation of blood vessels for up to 40 minutes, which reduces the amount of oxygen reaching the heart. Anger and hatred cause blood vessels to constrict through sympathetic activation. This weakens the heart muscle and can lead to heart failure or irregular heartbeat.


What happens in the long term?


If you have persistent negative thoughts, your heart is under constant pressure, which can lead to high blood pressure. Blood vessels gradually narrow, reducing blood flow to the heart and increasing the risk of heart attack or stroke.


High risk with chronic illness if you have negative thoughts


If you already have diabetes, high blood pressure or high cholesterol, the risk doubles if you add negative thoughts. Because these diseases are already weakening the heart, negative thoughts increase stress and put additional 'overload' on the heart. According to the British Heart Foundation, such stress increases the risk of heart disease and worsens the condition of those who already have heart problems.


When should you see a doctor?


If you feel pain or pressure in the chest, difficulty breathing, irregular heartbeat, excessive fatigue, dizziness, cold sweats, or pain in the hands are more common after negative thoughts or stress, this may be a sign of a heart problem.


It is advisable to go to the hospital immediately without delay. In addition, a heart check can be done during regular checkups. ECG, echo, and blood tests also tell about the condition of the heart in a timely manner.


In addition, these problems can be reduced to a large extent by consulting a mental health professional.


How to reduce negative thoughts and stress?


First of all, it is important to establish positive habits in your daily life. Practicing deep breathing or meditation for 10-15 minutes every day calms the mind and reduces stress.


Similarly, walking or doing light exercise for about 30 minutes daily keeps the body active, the mind light, and heart health is also good. If a negative thought comes to your mind, you should not let it develop further. At such times, remind yourself, "This thought may not always be right, I can try."


Such positive thoughts gradually strengthen the mind.


Keeping things bottled up inside can increase stress. Talking to family members or close friends can help you relax and understand your problems and find solutions.


Eating a healthy diet, getting enough sleep, and avoiding social media can also help you feel more at ease. If you still feel stressed and negative thoughts aren't going away after trying these things, talk to a therapist or counselor. 

Popular Posts