Local VLM Touch | AI-Powered Touchscreen Platform

The Problem

Current HMI solutions fall short

🧠

Need Modernization

Existing HMI solutions haven't kept pace with the latest innovations in multimodal LLMs. Users expect intelligent, conversational interfaces — not static button grids.

💰

Cloud & Library Costs

Adding LLM or AI features means ongoing cloud API costs or expensive library licenses. Per-query pricing makes scaling unpredictable and eats into margins.

🔒

Privacy Concerns

Facial recognition and voice commands processed in the cloud mean your biometric data leaves your premises. For many applications, this is simply not acceptable.

⏱️

No Real-Time Priority

Non real-time operating systems can't prioritize critical events. When an alarm triggers, Android and Linux panels offer "best effort" — not guaranteed response times.

📦

Supply Chain Issues

Finding the right hardware is a challenge. Long lead times, minimum order quantities, and component shortages make it difficult to get what you need, when you need it.

🔌

Limited I/O Options

Modern applications require integrating new sensors — radar, multiple cameras, environmental monitors. Most HMI panels lack the I/O flexibility to accommodate them.

Our solution addresses all of these challenges.

Our Platform

Complete system, delivered as one

Hardware and software designed together, optimized for local AI and real-time performance. Cost-effective design leveraging open-source platforms.

AI Touchscreen Display

Thin client architecture — powerful enough for AI vision, light enough to stay responsive. Acts as an intelligent interface to your hub, panel, or automation system.

⚡

High Performance MCU

Dual RISC-V processor running at up to 400 MHz, designed for HMI applications and edge computing with rich I/O capabilities.

📡

Comprehensive Connectivity

2.4 and 5 GHz dual-band Wi-Fi 6, Bluetooth 5 (LE), and 802.15.4 for Thread, Zigbee, Matter, HomeKit, and MQTT support.

👤

On-Device AI Vision

Facial recognition for secure operations, processed entirely on the device. No cloud, no external servers, no privacy compromise.

🎥

Video Capabilities

Hardware video encoding for WebRTC streaming. Display IP camera feeds or use built-in camera for video intercom applications.

🔌

Expansion Ready

Available I/O for adding 60 / 77 GHz radar for 3D person sensing, dual cameras, co-processors for Z-Wave, proprietary wireless, or RS-485 bus protocols.

⏱️

RTOS Foundation

Real-time operating system delivers consistent, predictable performance — faster and more reliable than Linux or Android-based alternatives.

Local VLM Box

An edge AI platform for developing and deploying intelligent agents — completely offline with enterprise-grade capabilities.

🧠

Offline AI Processing

40 TOPS AI acceleration with Ara-2 Runtime SDK. Run 8B VLM models locally with OpenAI-compliant REST API — no internet required.

🤖

Agentic AI Framework

Built on Google ADK orchestrator for multi-agent workflows. Seamlessly integrate AI agents via Agent-to-Agent (A2A) protocol for complex task automation.

🔧

MCP Tool Integration

Extend capabilities with Model Context Protocol (MCP) tools. Connect to Home Assistant, time series databases, and custom backends with plug-and-play ease.

🐳

Containerized Deployment

Deploy at scale via Docker containers. Gateway server architecture enables easy monitoring and management of ecosystem components.

🎯

Flexible LLM Support

Bring any supported LLM to work within the orchestrator. Fine-tune with your RAG data for domain-specific responses.

🛡️

Safe AI Guardrails

Built-in safety mechanisms ensure appropriate, reliable responses for consumer and enterprise deployments.

📱

Ready-to-Use Client Apps

Pre-built client agent / meet apps for Android, iOS, Desktop, Web, and Embedded Linux — get started instantly on any platform.

Local VLM Box - APC880E Edge AI Platform

Multiple Display Sizes

From compact interfaces to full tablet experiences — same platform, same software stack.

4" Compact Toys, Thermostats

7" Standard Home Panels, Intercoms

10" Large Factory Floor, Smart Hubs

Built for Your Brand

We deliver complete hardware and software solutions tailored to your requirements.

Custom branding and logos on hardware
Light or dark body color options
GUI light and dark mode themes
Port your existing interface to our platform
VLM fine-tuning with your domain knowledge
Custom expansion modules and sensors

Applications

Where Local VLM Touch fits

🏠

Alarm & Automation Touchscreens

Intelligent display paired with your hub or panel. Facial recognition for secure disarm, natural language control, and seamless integration with existing automation systems.

🌐

IoT Solutions

Rich connectivity options — Thread, Zigbee, Matter, HomeKit, MQTT — make this the ideal interface for IoT deployments. Display sensor data, control devices, monitor systems.

🏭

Industrial Automation

Real-time operating system delivers consistent performance for machine interfaces. Expansion options for RS-485 bus protocols and industrial sensors. Rugged and reliable.

🤖

LLM Robots

Expressive face display with local AI vision and VLM connectivity. Perfect for social robots, assistants, and interactive installations requiring natural interaction.

🧒

Kids Learning Companions

Safe, offline AI with built-in guardrails. No inappropriate content, no data collection, no cloud dependency. Parents can trust what their children interact with.

FAQ

Frequently Asked Questions

Common questions about our platform and technology.

No, ARM64 apps cannot run directly on the ESP32-P4. The ESP32-P4 uses a RISC-V 32-bit architecture and runs FreeRTOS (a real-time operating system), not Linux. ARM64 binaries are incompatible — it's a completely different instruction set and operating environment.

However, the VLM Box runs on ARM64 Linux (i.MX8M Plus with Cortex-A53), so existing ARM64 Linux applications can potentially run there with minimal modifications. This architecture gives you the best of both worlds: a responsive RTOS-based touchscreen for HMI, and a Linux-based edge computer for complex processing tasks.

The ESP32-P4 can perform basic face detection using Espressif's ESP-WHO framework. However, for secure face recognition with liveness detection (anti-spoofing), the processing requirements exceed what the ESP32-P4 can handle in real-time.

For advanced biometric applications, we use a two-step approach: the ESP32-P4 handles initial face detection, then streams the camera frames along with Time-of-Flight (ToF) sensor data to the i.MX8M Plus edge computer. The i.MX8M Plus runs the liveness detection algorithm using ToF depth data to distinguish real faces from photos, videos, or masks — all processed locally without cloud dependency.

The ESP32-P4 is purpose-built for Human-Machine Interface (HMI) applications. Its RTOS foundation delivers consistent, predictable response times — critical for touch interfaces. It handles display rendering, touch input, camera preview, audio I/O, and network communication with sub-millisecond latency.

More powerful Linux-based processors introduce unpredictable latency, higher power consumption, longer boot times, and greater complexity. By separating the UI (ESP32-P4) from heavy compute (VLM Box), we optimize each component for its specific role.

No, all core functionality runs offline. The touchscreen and VLM Box communicate over your local network. AI processing, face recognition, voice commands, and VLM inference all happen on-premises — no cloud accounts, no subscriptions, no data leaving your network.

Internet connectivity is optional and only needed if you want features like remote access, OTA updates, or integration with external services.

The touchscreen (ESP32-P4) is the user interface — it handles display, touch, camera, microphone, speaker, and real-time interactions. It runs an RTOS for instant responsiveness and can perform basic on-device AI like face detection.

The VLM Box is an edge AI computer with a 40 TOPS NPU. It handles compute-intensive tasks: running 7B Vision Language Models, advanced face recognition, natural language understanding, and complex reasoning — all locally. Think of it as a local AI server that your touchscreen can query.

Yes. The VLM can be fine-tuned with your own RAG (Retrieval-Augmented Generation) data to provide domain-specific responses. You can customize the knowledge base, personality, and response style for your specific application — whether that's a smart home assistant, industrial support system, or customer service kiosk.

The touchscreen UI is also fully customizable with your branding, color schemes, and interface layouts.

Next-gen Touchscreens
With Edge AI

Current HMI solutions fall short

Need Modernization

Cloud & Library Costs

Privacy Concerns

No Real-Time Priority

Supply Chain Issues

Limited I/O Options

Complete system, delivered as one

AI Touchscreen Display

Local VLM Box

Multiple Display Sizes

Built for Your Brand

Local First

One-Time Purchase

Thin Client Architecture

AI Agent Ready

Where Local VLM Touch fits

Alarm & Automation Touchscreens

IoT Solutions

Industrial Automation

LLM Robots

Kids Learning Companions

Coming soon

Frequently Asked Questions

Join Our Waitlist

Let's discuss your project

Next-gen Touchscreens With Edge AI

Current HMI solutions fall short

Need Modernization

Cloud & Library Costs

Privacy Concerns

No Real-Time Priority

Supply Chain Issues

Limited I/O Options

Complete system, delivered as one

AI Touchscreen Display

Local VLM Box

Multiple Display Sizes

Built for Your Brand

Local First

One-Time Purchase

Thin Client Architecture

AI Agent Ready

Where Local VLM Touch fits

Alarm & Automation Touchscreens

IoT Solutions

Industrial Automation

LLM Robots

Kids Learning Companions

Coming soon

Frequently Asked Questions

Join Our Waitlist

Sign up for updates

Let's discuss your project

Next-gen Touchscreens
With Edge AI