By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
Scoopico
  • Home
  • U.S.
  • Politics
  • Sports
  • True Crime
  • Entertainment
  • Life
  • Money
  • Tech
  • Travel
Reading: Qwen-Picture Edit offers Photoshop a run for its cash with AI-powered text-to-image edits that work in seconds
Share
Font ResizerAa
ScoopicoScoopico
Search

Search

  • Home
  • U.S.
  • Politics
  • Sports
  • True Crime
  • Entertainment
  • Life
  • Money
  • Tech
  • Travel

Latest Stories

8/19: The Each day Report – CBS Information
8/19: The Each day Report – CBS Information
Images From the Streets: Nationwide Guard and DC Protests
Images From the Streets: Nationwide Guard and DC Protests
Jenelle Evans Addresses Unstable Texts With Son Jace
Jenelle Evans Addresses Unstable Texts With Son Jace
'In precept, there are all the time justifications for limiting & curbing threats to nat'l safety'
'In precept, there are all the time justifications for limiting & curbing threats to nat'l safety'
Ending ‘de minimus’ tax break = fewer selections
Ending ‘de minimus’ tax break = fewer selections
Have an existing account? Sign In
Follow US
  • Contact Us
  • Privacy Policy
  • Terms of Service
2025 Copyright © Scoopico. All rights reserved
Qwen-Picture Edit offers Photoshop a run for its cash with AI-powered text-to-image edits that work in seconds
Tech

Qwen-Picture Edit offers Photoshop a run for its cash with AI-powered text-to-image edits that work in seconds

Scoopico
Last updated: August 20, 2025 5:56 am
Scoopico
Published: August 20, 2025
Share
SHARE

Need smarter insights in your inbox? Join our weekly newsletters to get solely what issues to enterprise AI, information, and safety leaders. Subscribe Now


Adobe Photoshop is among the many most recognizable items of software program ever created, utilized by greater than 90% of the world’s artistic professionals, based on Photutorial.

So the truth that a new open supply AI mannequin — Qwen-Picture Edit, launched yesterday by Chinese language e-commerce large Alibaba’s Qwen Staff of AI researchers — is now capable of accomplish an enormous variety of Photoshop-like enhancing jobs with textual content inputs alone, is a notable achievement.

Constructed on the 20-billion-parameter Qwen-Picture basis mannequin launched earlier this month, Qwen-Picture-Edit extends the system’s distinctive strengths in textual content rendering to cowl a large spectrum of enhancing duties, from delicate look adjustments to broader semantic transformations.

Merely add a beginning picture — I attempted one in all myself from VentureBeat’s final annual Rework convention in San Francisco — after which kind directions of what you wish to change, and Qwen-Picture-Edit will return a brand new picture with these edits utilized.


AI Scaling Hits Its Limits

Energy caps, rising token prices, and inference delays are reshaping enterprise AI. Be part of our unique salon to find how prime groups are:

  • Turning vitality right into a strategic benefit
  • Architecting environment friendly inference for actual throughput positive factors
  • Unlocking aggressive ROI with sustainable AI techniques

Safe your spot to remain forward: https://bit.ly/4mwGngO


Enter picture instance:

Picture credit score: Michael O’Donnell Images

Output picture instance with immediate: “Make the person carrying a tuxedo.”

The mannequin is offered now throughout a number of platforms, together with Qwen Chat, Hugging Face, ModelScope, GitHub, and thru the Alibaba Cloud software programming interface (API), the latter which permits any third-party developer or enterprise to combine this new mannequin into their very own purposes and workflows.

I created my examples above on Qwen Chat, the Qwen Staff’s rival to OpenAI’s ChatGPT, nonetheless, it must be famous for any aspiring customers that generations are restricted to about 8 free jobs (enter/outputs) per 12 hour interval earlier than it resets. Paying customers can have entry to extra jobs.

With assist for each English and Chinese language inputs, and a twin concentrate on each semantic that means and visible constancy, Qwen-Picture-Edit goals to decrease limitations to professional-grade visible content material creation.

And provided that the mannequin is offered as an open supply code beneath an Apache 2.0 license, it’s protected for enterprises to take, obtain and arrange at no cost on their very own {hardware} or digital clouds/machines, doubtlessly leading to an enormous value financial savings from proprietary software program like Photoshop.

As Junyang Lin, a Qwen Staff researcher wrote on X, “it will probably take away a strand of hair, very delicate picture modification.”

The workforce’s announcement echoes this sentiment, presenting Qwen-Picture-Edit not as a completely new system, however as a pure extension of Qwen-Picture that applies its distinctive textual content rendering and dual-encoding method on to enhancing duties.

Twin encodings enable for edits preserving type and content material of authentic picture

Qwen-Picture-Edit builds on the muse established by Qwen-Picture, which was launched earlier this 12 months as a large-scale mannequin specializing in each picture era and textual content rendering.

Qwen-Picture’s technical report highlighted its potential to deal with complicated duties like paragraph-level textual content rendering, Chinese language and English characters, and multi-line layouts with accuracy.

The report additionally emphasised a dual-encoding mechanism, feeding pictures concurrently into Qwen2.5-VL for semantic management and a variational autoencoder (VAE) for reconstructive element. This method permits edits that stay trustworthy to each the intent of the immediate and the look of the unique picture.

Those self same architectural decisions underpin Qwen-Picture-Edit. By leveraging twin encodings, the mannequin can regulate at two ranges: semantic edits that change the that means or construction of a scene, and look edits that introduce or take away parts whereas retaining the remainder untouched.

Semantic enhancing consists of creating new mental property, rotating objects 90 or 180 levels to disclose completely different views, or remodeling an enter into one other type comparable to Studio Ghibli-inspired artwork. These edits sometimes modify many pixels however protect the underlying identification of objects.

Right here’s an instance of semantic enhancing from Shridhar Athinarayanan, an engineer at AI purposes platform Replicate, who used a Replicate-hosted implementation or “inference” of Qwen to reskin a photograph of Manhattan to appear to be a toy Lego set.

Look enhancing focuses on exact, native adjustments. In these instances, many of the picture stays unchanged whereas particular objects are altered. Demonstrations embody including a signboard that generates a mirrored image in water, eradicating stray hair strands from a portrait, and altering the colour of a single letter in a textual content picture.

One good instance of look enhancing with Qwen-Picture Edit comes from AnswerAI co-founder and CEO Thomas Hill who posted a side-by-side on X displaying his spouse in her marriage ceremony gown beneath an archway and one other with the identical archway coated with graffiti:

Mixed with Qwen’s established power in rendering Chinese language and English textual content, the editing-focused system is positioned as a versatile instrument for creators who want greater than easy generative imagery.

The twin management over semantic scope and look constancy means the identical instrument can serve very completely different wants, from artistic IP improvement to production-level photograph retouching.

Including or eradicating textual content to pictures

One other standout functionality is bilingual textual content enhancing. Qwen-Picture-Edit permits customers so as to add, take away, or modify textual content in each Chinese language and English whereas preserving font, dimension, and magnificence.

This expands on Qwen-Picture’s repute for sturdy textual content rendering, notably in difficult eventualities like intricate Chinese language characters.

In observe, this enables for correct enhancing of posters, indicators, T-shirts, or calligraphy artworks the place small textual content particulars matter, as seen in one other instance from Replicate beneath.

One demonstration concerned correcting errors in a bit of generated Chinese language calligraphy by a step-by-step chained enhancing course of.

Customers might spotlight incorrect areas, instruct the system to repair them, after which additional refine particulars till the proper characters had been rendered. This iterative method exhibits how the mannequin may be utilized to high-stakes enhancing duties the place precision is crucial.

Functions and use instances

The Qwen workforce has highlighted a spread of potential purposes:

  • Artistic design and IP growth, comparable to producing mascot-based emoji packs.
  • Promoting and content material creation, the place logos, signage, and text-heavy visuals may be custom-made.
  • Digital avatars and artwork, with type switch supporting distinctive character representations.
  • Images and private use, together with background changes, clothes adjustments, and object removing.
  • Cultural preservation, demonstrated by correcting classical calligraphy works.

By bridging fine-grained enhancing with broader artistic transformations, Qwen-Picture-Edit caters to professionals who want management whereas remaining approachable for informal experimentation.

Benchmarking and efficiency

In keeping with the Qwen workforce, evaluations throughout public benchmarks point out that Qwen-Picture-Edit delivers state-of-the-art efficiency in picture enhancing.

This follows from the broader Qwen-Picture technical evaluations, the place the bottom mannequin achieved main leads to each normal picture era and textual content rendering duties.

Whereas particular enhancing benchmark figures weren’t detailed within the launch, Qwen-Picture itself ranked extremely in unbiased evaluations comparable to AI Enviornment, the place human raters in contrast outputs throughout fashions from completely different suppliers.

API pricing and availability

By Alibaba Cloud Mannequin Studio, builders can entry Qwen-Picture-Edit as an API. Pricing is about at $0.045 per picture, with a free quota of 100 pictures legitimate for 180 days after activation.

The service is initially out there within the Singapore area, with a charge restrict of 5 requests per second and as much as two concurrent duties per account.

To make use of the API, builders should get hold of a Mannequin Studio API key and might name the mannequin by way of HTTP or by the DashScope SDK in Python or Java.

Photos may be submitted as URLs or in Base64 format, with supported resolutions starting from 512 to 4,096 pixels and file sizes as much as 10 MB. Output pictures are hosted on Alibaba Cloud Object Storage with hyperlinks legitimate for twenty-four hours, requiring customers to obtain and save outcomes promptly.

What’s subsequent for Qwen?

Qwen positions Picture-Edit as a step toward reducing limitations for visible content material creation. By making exact, style-consistent enhancing extra accessible, the mannequin might assist purposes from design studios to informal customers refining private initiatives.

The system additionally alerts a broader pattern in AI improvement: shifting past single-purpose era towards instruments that combine enhancing, correction, and refinement.

With each semantic flexibility and appearance-level precision, Qwen-Picture-Edit displays this shift, mixing the generative strengths of enormous fashions with the reliability required for skilled enhancing.

Every day insights on enterprise use instances with VB Every day

If you wish to impress your boss, VB Every day has you coated. We provide the inside scoop on what firms are doing with generative AI, from regulatory shifts to sensible deployments, so you possibly can share insights for max ROI.

Learn our Privateness Coverage

Thanks for subscribing. Take a look at extra VB newsletters right here.

An error occured.

[/gpt3]
At present’s Hurdle hints and solutions for August 16, 2025
23 Greatest Energy Banks (2025), Examined and Reviewed
Finest headphones deal: Sony WH-1000XM4s for beneath $200
CookUnity Ready Meal Supply Overview (2025): Chef-Centric Meals
What’s reverse charging? How one can use my favourite cellular characteristic
Share This Article
Facebook Email Print

POPULAR

8/19: The Each day Report – CBS Information
U.S.

8/19: The Each day Report – CBS Information

Images From the Streets: Nationwide Guard and DC Protests
Politics

Images From the Streets: Nationwide Guard and DC Protests

Jenelle Evans Addresses Unstable Texts With Son Jace
Entertainment

Jenelle Evans Addresses Unstable Texts With Son Jace

'In precept, there are all the time justifications for limiting & curbing threats to nat'l safety'
News

'In precept, there are all the time justifications for limiting & curbing threats to nat'l safety'

Ending ‘de minimus’ tax break = fewer selections
Opinion

Ending ‘de minimus’ tax break = fewer selections

Yankees’ Supervisor Aaron Boone on Aaron Decide’s Well being: ‘I Do not Know But’
Sports

Yankees’ Supervisor Aaron Boone on Aaron Decide’s Well being: ‘I Do not Know But’

Scoopico

Stay ahead with Scoopico — your source for breaking news, bold opinions, trending culture, and sharp reporting across politics, tech, entertainment, and more. No fluff. Just the scoop.

  • Home
  • U.S.
  • Politics
  • Sports
  • True Crime
  • Entertainment
  • Life
  • Money
  • Tech
  • Travel
  • Contact Us
  • Privacy Policy
  • Terms of Service

2025 Copyright © Scoopico. All rights reserved

Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?