All posts
7 min read

What If You Could Run Finance by Talking to It?

future of financeconversational AIinterface design
Arfiti

What If You Could Run Finance by Talking to It?

There is a pattern that has repeated itself across every major computing era, and it goes like this: the interface between humans and machines gets compressed. Each compression removes a layer of translation. Each removal makes the technology accessible to more people and reduces the gap between what someone intends to do and what actually happens.

We are about to watch this pattern play out in one of the last holdout domains: finance operations.

A Brief History of Interface Compression

In the 1970s, running a computer meant typing precise commands into a terminal. You needed to know the exact syntax. A misplaced character meant failure. The barrier to entry was memorizing a language that machines understood but humans did not naturally speak.

Then came the graphical user interface. The Macintosh in 1984, Windows in 1985. Suddenly, you could point at things and click them. You did not need to memorize commands. You needed to recognize icons and navigate menus. The number of people who could use a computer expanded by orders of magnitude.

The web compressed the interface again. You no longer needed to install software. You opened a browser and navigated to a URL. Information and applications became accessible from any machine, anywhere. The friction of distribution collapsed.

Mobile compressed it further. The smartphone put a computer in every pocket. Touch replaced mouse and keyboard. Apps replaced websites for frequent tasks. Computing became something you did in the elevator, at the coffee shop, in bed. The number of internet users went from 1 billion to 5 billion in fifteen years.

Each transition followed the same logic: reduce the translation layer between human intent and machine action. Terminal required you to translate your intent into exact syntax. GUI required you to translate intent into clicks and navigation. Web required you to find the right URL and page. Mobile required you to find the right app and tap the right button.

Conversation removes the translation layer almost entirely. You state your intent in the language you already think in, and the system acts on it. The gap between "what I want" and "what happens" approaches zero.

Why Finance Has Resisted Every Previous Compression

Finance software has participated in each interface era, but it has never been transformed by them. Consider the trajectory of an ERP system. In the 1990s, SAP ran on green-screen terminals. In the 2000s, it got a GUI. In the 2010s, it moved to the web. In the 2020s, it got mobile apps.

And yet, the fundamental experience of operating a finance system has barely changed. A controller closing the month in 2026 still navigates through nested menus, fills out multi-field forms, runs reports by selecting parameters from dropdowns, and copies data between screens. The interface changed. The interaction did not.

This is because previous compressions optimized for discovery and navigation -- making it easier to find things and move between them. But finance work is not primarily about finding and navigating. It is about asking questions, making decisions, and recording transactions. These are fundamentally linguistic acts.

"What was our revenue last quarter?" is a question. "Approve this expense report" is a decision. "Record this invoice against the Acme account" is an instruction.

Every day, finance professionals translate these natural-language thoughts into a series of clicks, menu selections, field entries, and button presses. The software does not understand the intent. It only understands the sequence of inputs. The human does all the translation work.

This is why conversational interfaces are not just another incremental improvement for finance. They are the first interface paradigm that actually matches how finance professionals think and work.

Four Reasons Finance Is Uniquely Suited for Conversation

The work is fundamentally about questions and decisions. A typical controller's day consists of asking the system questions ("What is our cash position?"), giving it instructions ("Post this journal entry"), and making decisions ("Approve this payment run"). These map directly to conversational exchanges. Unlike design work, which requires spatial manipulation, or video editing, which requires timeline interaction, finance operations are almost entirely expressible as requests and responses.

The complexity is in the domain knowledge, not the interaction. A senior controller knows exactly what they want. They understand GAAP, they know which accounts to use, they can spot an anomaly in a trial balance in seconds. Their expertise is deep. But they spend a disproportionate amount of time fighting the software to express that expertise -- navigating to the right screen, entering data in the right fields, running the right report with the right parameters. Conversation lets the domain expert operate at the speed of their knowledge instead of at the speed of the interface.

Natural language maps cleanly to financial operations. Double-entry bookkeeping is one of the oldest and most formalized systems in business. It is rules-based, unambiguous, and internally consistent. When a controller says "record a $50,000 payment to Acme Corp against invoice 1047," there is exactly one correct interpretation and one correct set of accounting entries. This determinism makes financial language unusually well-suited for AI interpretation. Compare this to a request like "make the design feel more modern" -- finance instructions carry precision that creative instructions do not.

Conversation creates a natural audit trail. In traditional systems, an auditor sees that user jsmith posted a journal entry at 3:47 PM on March 15. What they cannot see is why. They have to ask jsmith, who may or may not remember. In a conversational system, the full context is preserved: "I'm reclassifying $12,000 from marketing to R&D because the Q1 campaign was actually a product research initiative. Per discussion with VP of Engineering on March 12." The instruction, the rationale, and the action are captured together. Every operation carries its own documentation.

A Day in Two Worlds

Consider Maria, a controller at a 200-person company with three legal entities. Let us follow her through a Monday morning in two parallel universes.

World A: Traditional Finance Software

8:30 AM -- Maria logs into the ERP. She needs to check the cash position across all three entities. She navigates to the banking module, selects Entity 1, runs the cash report, notes the balance. Switches to Entity 2, runs it again. Switches to Entity 3. She opens a spreadsheet and types in the three numbers to get a consolidated view. Elapsed time: 14 minutes.

9:00 AM -- She needs to review and approve five vendor invoices that came in over the weekend. She navigates to accounts payable, opens the invoice queue, clicks into each invoice, checks the coding, verifies the amounts against purchase orders (switching to another module), and clicks approve. One invoice has an incorrect GL account. She edits the line, saves, then approves. Elapsed time: 35 minutes.

9:45 AM -- The CEO asks for a quick revenue breakdown by entity for the board deck due this afternoon. Maria navigates to reporting, selects the revenue report, configures parameters for each entity, exports to Excel, builds a summary table, formats it, and emails it. Elapsed time: 40 minutes.

10:30 AM -- She needs to post an accrual for a contract that spans two periods. She navigates to the journal entry screen, manually enters six lines (two per entity), double-checks the amounts, saves as draft, reviews, and posts. Elapsed time: 22 minutes.

Total time for four tasks: 1 hour 51 minutes. Most of that time was navigation, data entry, and context switching between modules.

World B: Conversational Finance

8:30 AM -- Maria opens her finance system and types: "What is our consolidated cash position this morning?" The system responds in four seconds with a table showing balances across all three entities, the consolidated total, and the change from Friday. Elapsed time: 30 seconds.

8:35 AM -- "Show me the five pending vendor invoices." The system displays them with GL coding, PO matches, and amounts. "Approve invoices 1, 2, 3, and 5. On invoice 4, change the GL account to 6200 and then approve." Done. Elapsed time: 3 minutes.

8:40 AM -- "Give me a revenue breakdown by legal entity for the current quarter, formatted for a board presentation." The system returns a clean table with entity names, revenue figures, percentage of total, and quarter-over-quarter change. Maria copies it directly into the deck. Elapsed time: 2 minutes.

8:45 AM -- "Post an accrual for the Meridian contract: $15,000 per entity, debit 5100, credit 2300, memo 'Q1 Meridian contract accrual, period 2 of 6.'" The system confirms the six journal lines, shows the balanced entry, and posts it. Elapsed time: 1 minute.

Total time for the same four tasks: under 7 minutes.

This is not a marginal improvement. It is a structural reduction in the time between intent and outcome. Maria's expertise -- the knowledge of which accounts to use, which invoices look right, what the CEO needs -- is identical in both worlds. The only difference is how much translation work the interface demands of her.

"But I Need to See the Numbers"

This is the most common objection, and it is entirely valid. Finance is a domain where trust is built through verification. Controllers do not want to operate blindly. They want to see trial balances, review reports, scan for anomalies, and verify that entries posted correctly.

The answer is not that dashboards and reports disappear. They do not. The distinction is between the interface for doing and the interface for reviewing.

The interface for doing is conversation. You instruct, approve, query, and transact through natural language. This is where the time savings come from, because doing is the repetitive, navigation-heavy part of the job.

The interface for reviewing is visual. Dashboards, reports, drill-downs, and visualizations remain exactly where you expect them. When Maria wants to scan the trial balance for anomalies, she looks at a trial balance. When she wants to review the aging report, she opens a report. The visual interface serves its purpose -- pattern recognition, verification, exploration -- better than conversation ever could.

The key insight is that most finance professionals spend 70-80% of their time doing and 20-30% reviewing. Optimizing the doing through conversation while preserving the reviewing through visual interfaces captures the best of both paradigms.

The Trust Architecture

For conversation to work in finance, trust cannot be aspirational. It must be architectural. This means several things in practice.

Every conversational instruction must produce a verifiable result. When you say "post this entry," the system shows you exactly what it posted and where. There is no black box. The AI interprets your intent, proposes an action, and you confirm or correct before execution.

Every action must be auditable. The conversational record itself becomes part of the audit trail. Who asked for what, when, and what happened as a result. This is actually stronger than traditional audit trails, which capture only the action, not the reasoning.

The system must know its boundaries. When a request is ambiguous ("book the Acme thing"), the system should ask for clarification rather than guess. When a request requires approval ("transfer $500,000"), the system should route it through the appropriate workflow. When a request is outside its capability, it should say so clearly.

These are not nice-to-haves. They are the requirements that separate a useful tool from a dangerous one in a financial context.

The Next Compression

We are at the beginning of this transition. The technology to make conversational finance reliable exists today -- large language models that understand financial domain language, structured databases that enforce accounting rules, workflow engines that manage approvals and controls.

What has been missing is a system designed from the ground up for this paradigm. Most attempts at "AI for finance" bolt a chatbot onto an existing ERP. The result is a system that can answer simple questions but cannot actually operate -- because the underlying architecture was designed for forms and menus, not for intent-driven actions.

Arfiti takes the opposite approach: an ERP built natively for conversational operation. The AI is not an add-on. It is the primary interface. The database enforces financial rules. The workflow engine manages risk and approvals. The visual layer provides dashboards and reports for review. But the way you operate -- day in, day out -- is by talking to your finance system and having it respond with the precision and reliability that financial operations demand.

The terminal-to-GUI transition created an industry. The GUI-to-web transition created another. Each compression expanded the market, reduced costs, and made previously complex operations accessible to smaller organizations.

The compression from clicking to conversation will do the same for finance. The question is not whether it will happen, but how quickly finance leaders will recognize that the interface they have been fighting with for decades was never the right one for the work they do.

The right interface was always language. We just needed the technology to catch up.

Subscribe to new posts

Get notified when we publish new insights on AI-native finance.