Close Menu

    Subscribe to Updates

    Get the latest Tech news from SynapseFlow

    What's Hot

    This Supreme Court decision is bad news for Hollywood’s AI ambitions

    March 14, 2026

    How to Make a Killing review: a serial killer story should not be this boring

    March 14, 2026

    NASA Selects Finalists in Student Aircraft Maintenance Competition – NASA

    March 13, 2026
    Facebook X (Twitter) Instagram
    • Homepage
    • About Us
    • Contact Us
    • Privacy Policy
    Facebook X (Twitter) Instagram YouTube
    synapseflow.co.uksynapseflow.co.uk
    • AI News & Updates
    • Cybersecurity
    • Future Tech
    • Reviews
    • Software & Apps
    • Tech Gadgets
    synapseflow.co.uksynapseflow.co.uk
    Home»Future Tech»Deep Seek OCR Condenses Charts and Code and Reduces Tokens Per Image by 20X
    Deep Seek OCR Condenses Charts and Code and Reduces Tokens Per Image by 20X
    Future Tech

    Deep Seek OCR Condenses Charts and Code and Reduces Tokens Per Image by 20X

    The Tech GuyBy The Tech GuyOctober 25, 2025No Comments3 Mins Read2 Views
    Share
    Facebook Twitter LinkedIn Pinterest Email
    Advertisement


    DeepSeek’s announced OCR (Optical Character Recognition) model compresses text-heavy data into images and reduces vision tokens per image by up to 20x while retaining 97% accuracy (10x compression) or ~60% at 20x. This outperforms competitors on efficiency-performance charts.

    Advertisement

    arxiv – DeepSeek-OCR: Contexts Optical Compression

    We present DeepSeek-OCR as an initial investigation into the feasibility of compressing long contexts via optical 2D mapping. DeepSeek-OCR consists of two components: DeepEncoder and DeepSeek3B-MoE-A570M as the decoder. Specifically, DeepEncoder serves as the core engine, designed to maintain low activations under high-resolution input while achieving high compression ratios to ensure an optimal and manageable number of vision tokens. Experiments show that when the number of text tokens is within 10 times that of vision tokens (i.e., a compression ratio < 10x), the model can achieve decoding (OCR) precision of 97%. Even at a compression ratio of 20x, the OCR accuracy still remains at about 60%. This shows considerable promise for research areas such as historical long-context compression and memory forgetting mechanisms in LLMs. Beyond this, DeepSeek-OCR also demonstrates high practical value. On OmniDocBench, it surpasses GOT-OCR2.0 (256 tokens/page) using only 100 vision tokens, and outperforms MinerU2.0 (6000+ tokens per page on average) while utilizing fewer than 800 vision tokens. In production, DeepSeek-OCR can generate training data for LLMs/VLMs at a scale of 200k+ pages per day (a single A100-40G). Codes and model weights are publicly accessible at this http URL.

    This eases LLM bottlenecks in long-context tasks (large codebases) by fitting more info into shorter windows, avoiding performance drops.

    It speeds up and cheapens model training—crucial for China amid GPU shortages—echoing.

    It conveys dense ideas (text, emotions, visuals) compactly. Enables generating 200k+ pages of training data daily for LLMs/VLMs.

    It parses charts (financial reports to structured data), chemical formulas (to SMILES format), geometric figures, and natural images; retains general visual/language skills (description, detection).

    Andrej Karpathy’s says the Deep seek paper is a good OCR model and is intrigued by pixels vs. text tokens. Pixels may be superior inputs for LLMs, rendering text as images for richer, more efficient processing.

    Suggests ditching tokenizers. We could shift to inputs as images (even for text) for holistic processing.

    Elon’s view is 99%+ of future AI I/O will be photons (light-based), as reality fundamentally runs on them.

    Brian Wang is a Futurist Thought Leader and a popular Science blogger with 1 million readers per month. His blog Nextbigfuture.com is ranked #1 Science News Blog. It covers many disruptive technology and trends including Space, Robotics, Artificial Intelligence, Medicine, Anti-aging Biotechnology, and Nanotechnology.

    Known for identifying cutting edge technologies, he is currently a Co-Founder of a startup and fundraiser for high potential early-stage companies. He is the Head of Research for Allocations for deep technology investments and an Angel Investor at Space Angels.

    A frequent speaker at corporations, he has been a TEDx speaker, a Singularity University speaker and guest at numerous interviews for radio and podcasts.  He is open to public speaking and advising engagements.

    Advertisement
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    The Tech Guy
    • Website

    Related Posts

    NASA Selects Finalists in Student Aircraft Maintenance Competition – NASA

    March 13, 2026

    The US Plans to Break Ground on a Permanent Moon Base by 2030. Here’s What It Will Take.

    March 13, 2026

    Robot Escorted Away By Cops After Terrorizing Old Woman

    March 13, 2026

    SpaceX Space AI Ramp | NextBigFuture.com

    March 13, 2026

    Tiny NASA Spacecraft Delivers Exoplanet Mission’s First Images

    March 12, 2026

    Grammarly Forgot to Mention Something in Its Giant Apology That Changes the Whole Story

    March 12, 2026
    Leave A Reply Cancel Reply

    Advertisement
    Top Posts

    The iPad Air brand makes no sense – it needs a rethink

    October 12, 202516 Views

    ChatGPT Group Chats are here … but not for everyone (yet)

    November 14, 20258 Views

    Facebook updates its algorithm to give users more control over which videos they see

    October 8, 20258 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Advertisement
    About Us
    About Us

    SynapseFlow brings you the latest updates in Technology, AI, and Gadgets from innovations and reviews to future trends. Stay smart, stay updated with the tech world every day!

    Our Picks

    This Supreme Court decision is bad news for Hollywood’s AI ambitions

    March 14, 2026

    How to Make a Killing review: a serial killer story should not be this boring

    March 14, 2026

    NASA Selects Finalists in Student Aircraft Maintenance Competition – NASA

    March 13, 2026
    categories
    • AI News & Updates
    • Cybersecurity
    • Future Tech
    • Reviews
    • Software & Apps
    • Tech Gadgets
    Facebook X (Twitter) Instagram Pinterest YouTube Dribbble
    • Homepage
    • About Us
    • Contact Us
    • Privacy Policy
    © 2026 SynapseFlow All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.

    Ad Blocker Enabled!
    Ad Blocker Enabled!
    Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.