MidJourney.com, Create Any Image you want with AI!!!

bnew

Veteran
Joined
Nov 1, 2015
Messages
57,449
Reputation
8,519
Daps
160,198

How much detail is too much? Midjourney v6 attempts to find out​

As Midjourney rolls out new features, it continues to make some artists furious.​

BENJ EDWARDS - 1/5/2024, 1:41 PM

An AI-generated image of a

Enlarge / An AI-generated image of a "Beautiful queen of the universe looking at the camera in sci-fi armor, snow and particles flowing, fire in the background" created using alpha Midjourney v6.

Midjourney

124

In December, just before Christmas, Midjourney launched an alpha version of its latest image synthesis model, Midjourney v6. Over winter break, Midjourney fans put the new AI model through its paces, with the results shared on social media. So far, fans have noted much more detail than v5.2 (the current default) and a different approach to prompting. Version 6 can also handle generating text in a rudimentary way, but it's far from perfect.

FURTHER READING​

“Stunning”—Midjourney update wows AI artists with camera-like feature

"It's definitely a crazy update, both in good and less good ways," artist Julie Wieland, who frequently shares her Midjourney creations online, told Ars. "The details and scenery are INSANE, the downside (for now) are that the generations are very high contrast and overly saturated (imo). Plus you need to kind of re-adapt and rethink your prompts, working with new structures and now less is kind of more in terms of prompting."

At the same time, critics of the service still bristle about Midjourney training its models using human-made artwork scraped from the web and obtained without permission—a controversial practice common among AI model trainers we have covered in detail in the past. We've also covered the challenges artists might face in the future from these technologies elsewhere.

Too much detail?​

With AI-generated detail ramping up dramatically between major Midjourney versions, one could wonder if there is ever such as thing as "too much detail" in an AI-generated image. Midjourney v6 seems to be testing that very question, creating many images that sometimes seem more detailed than reality in an unrealistic way, although that can be modified with careful prompting.


Previous Slide Next Slide



In our testing of version 6 (which can currently be invoked with the "--v 6.0" argument at the end of a prompt), we noticed times when the new model appeared to produce worse results than v5.2, but Midjourney veterans like Wieland tell Ars that those differences are largely due to the different way that v6.0 interprets prompts. That is something Midjourney is continuously updating over time. "Old prompts sometimes work a bit better than the day they released it," Wieland told us.
 

bnew

Veteran
Joined
Nov 1, 2015
Messages
57,449
Reputation
8,519
Daps
160,198
A comparison between output from Midjourney versions (from left to right: v3, v4, v5, v5.2, v6) with the prompt a muscular barbarian with weapons beside a CRT television set, cinematic, 8K, studio lighting.

Enlarge / A comparison between output from Midjourney versions (from left to right: v3, v4, v5, v5.2, v6) with the prompt "a muscular barbarian with weapons beside a CRT television set, cinematic, 8K, studio lighting."

Midjourney

We submitted Version 6 to our usual battery of image synthesis tests: barbarians with CRTs, cats holding cans of beer, plates of pickles, and Abraham Lincoln. Results felt a lot like Midjourney 5.2 but with more intricate detail. Compared to other AI image synthesis models available, Midjourney still seems to be the photorealism champion, although DALL-E 3 and fine-tuned versions of Stable Diffusion XL aren't far behind.

Compared with DALL-E 3, Midjourney v6 arguably bests its photorealism but falls behind in the prompt fidelity category. And yet v6 is notably more capable than v5.2 at handling descriptive prompts. "Version 6 is a bit more 'natural language,' less keywords and the usual prompt mechanics," says Wieland.

An AI-generated comparison of Abraham Lincoln using a computer at his desk using DALL-E 3 (left) and Midjourney v6 (right).

Enlarge / An AI-generated comparison of Abraham Lincoln using a computer at his desk using DALL-E 3 (left) and Midjourney v6 (right).

OpenAI, Midjourney

In an announcement on the Midjourney Discord, Midjourney creator David Holz described changes to v6:

Much more accurate prompt following as well as longer prompts

Improved coherence, and model knowledge

Improved image prompting and remix

Minor text drawing ability (you must write your text in "quotations" and --style raw or lower --stylize values may help)

/imagine a photo of the text "Hello World!" written with a marker on a sticky note --ar 16:9 --v 6

Improved upscalers, with both 'subtle' and 'creative' modes (increases resolution by 2x)

(you'll see buttons for these under your images after clicking U1/U2/U3/U4)

Style and prompting for V6

Prompting with V6 is significantly different than V5. You will need to 'relearn' how to prompt.

V6 is MUCH more sensitive to your prompt. Avoid 'junk' like "award winning, photorealistic, 4k, 8k"

Be explicit about what you want. It may be less vibey but if you are explicit it's now MUCH better at understanding you.

If you want something more photographic / less opinionated / more literal you should probably default to using --style raw

Lower values of --stylize (default 100) may have better prompt understanding while higher values (up to 1000) may have better aesthetics

Midjourney v6 is still a work in progress, with Holz announcing that things will change rapidly over the coming months. "DO NOT rely on this exact model being available in the future," he wrote. "It will significantly change as we take V6 to full release." As far as the current limitations go, Wieland says, "I try to keep in mind that this is just v6 alpha and they will do updates without announcements and it kind of feels, like they already did a few updates."

Midjourney is also working on a web interface that will be an alternative to (and potentially a replacement of) the current Discord-only interface. The new interface is expected to widen Midjourney's audience by making it more accessible.

An unresolved controversy​

FURTHER READING​

From toy to tool: DALL-E 3 is a wake-up call for visual artists—and the rest of us

Despite these technical advancements, Midjourney remains highly polarizing and controversial for some people. At the turn of this new year, viral threads emerged on social media from frequent AI art foes, criticizing the service anew. The posts shared screenshots of early conversations among Midjourney developers discussing how the technology could simulate many existing artists' styles. They included lists of artists and styles in the Midjourney training dataset that were revealed in November during discovery in a copyright lawsuit against Midjourney.

Some companies producing AI synthesis models, such as Adobe, seek to avoid these issues by training their models only on licensed images. But Midjourney's strength arguably comes from its ability to play fast and loose with intellectual property. It's undeniably cheaper to grab training data for free online than to license hundreds of millions of images. Until the legality of that kind of scraping is resolved in the US—or Midjourney adopts a different training approach—no matter how detailed or capable Midjourney gets, its ethics will continue to be debated.
 

The Devil's Advocate

Call me Dad
Joined
Jun 1, 2012
Messages
35,730
Reputation
7,716
Daps
98,961
Reppin
Better to reign in Hell than serve in Heaven
A comparison between output from Midjourney versions (from left to right: v3, v4, v5, v5.2, v6) with the prompt a muscular barbarian with weapons beside a CRT television set, cinematic, 8K, studio lighting.

Enlarge / A comparison between output from Midjourney versions (from left to right: v3, v4, v5, v5.2, v6) with the prompt "a muscular barbarian with weapons beside a CRT television set, cinematic, 8K, studio lighting."

Midjourney

We submitted Version 6 to our usual battery of image synthesis tests: barbarians with CRTs, cats holding cans of beer, plates of pickles, and Abraham Lincoln. Results felt a lot like Midjourney 5.2 but with more intricate detail. Compared to other AI image synthesis models available, Midjourney still seems to be the photorealism champion, although DALL-E 3 and fine-tuned versions of Stable Diffusion XL aren't far behind.

Compared with DALL-E 3, Midjourney v6 arguably bests its photorealism but falls behind in the prompt fidelity category. And yet v6 is notably more capable than v5.2 at handling descriptive prompts. "Version 6 is a bit more 'natural language,' less keywords and the usual prompt mechanics," says Wieland.

An AI-generated comparison of Abraham Lincoln using a computer at his desk using DALL-E 3 (left) and Midjourney v6 (right).

Enlarge / An AI-generated comparison of Abraham Lincoln using a computer at his desk using DALL-E 3 (left) and Midjourney v6 (right).

OpenAI, Midjourney

In an announcement on the Midjourney Discord, Midjourney creator David Holz described changes to v6:



Midjourney v6 is still a work in progress, with Holz announcing that things will change rapidly over the coming months. "DO NOT rely on this exact model being available in the future," he wrote. "It will significantly change as we take V6 to full release." As far as the current limitations go, Wieland says, "I try to keep in mind that this is just v6 alpha and they will do updates without announcements and it kind of feels, like they already did a few updates."

Midjourney is also working on a web interface that will be an alternative to (and potentially a replacement of) the current Discord-only interface. The new interface is expected to widen Midjourney's audience by making it more accessible.

An unresolved controversy​

FURTHER READING​

From toy to tool: DALL-E 3 is a wake-up call for visual artists—and the rest of us

Despite these technical advancements, Midjourney remains highly polarizing and controversial for some people. At the turn of this new year, viral threads emerged on social media from frequent AI art foes, criticizing the service anew. The posts shared screenshots of early conversations among Midjourney developers discussing how the technology could simulate many existing artists' styles. They included lists of artists and styles in the Midjourney training dataset that were revealed in November during discovery in a copyright lawsuit against Midjourney.

Some companies producing AI synthesis models, such as Adobe, seek to avoid these issues by training their models only on licensed images. But Midjourney's strength arguably comes from its ability to play fast and loose with intellectual property. It's undeniably cheaper to grab training data for free online than to license hundreds of millions of images. Until the legality of that kind of scraping is resolved in the US—or Midjourney adopts a different training approach—no matter how detailed or capable Midjourney gets, its ethics will continue to be debated.

Midjourney is also working on a web interface that will be an alternative to (and potentially a replacement of) the current Discord-only interface. The new interface is expected to widen Midjourney's audience by making it more accessible.


Can't fukking wait!!!
 

Wargames

One Of The Last Real Ones To Do It
Joined
Apr 1, 2013
Messages
25,511
Reputation
4,623
Daps
95,719
Reppin
New York City
They'll be ahead of the game with the prompts, they'll be aite
I know a guy whose career is in graphic design and he’s always had to hustle but the hustle got significantly harder.

It went from him trying and failing to get a popping NFT for generational wealth to him having to do more DJ gigs/parties to make ends meet.
 

Dillah810

Flat Girther
Joined
May 2, 2012
Messages
44,179
Reputation
10,224
Daps
170,910
Reppin
Flint, Michigan
I feel sorry for people who spent time and money on a graphic design degree.
Until Midjourney finds a way to source copyrighted reference images that are used to mend the final images, no company is willing to use it for their projects.

Even though we have a Midjourney account at my marketing agency for us to use, we're not allowed to use it for any client work because we would get dropped immediately if any of our clients get sued for their using copyrighted material. Companies take that shyt very serious.

Until Midjourney gets the legal aspects of their generated art, no company is going to touch it with a ten foot pole.

Right now we only use Midjourney for concepting ideas, and most companies that have something to lose are probably doing the same.
 
Last edited:

bnew

Veteran
Joined
Nov 1, 2015
Messages
57,449
Reputation
8,519
Daps
160,198
Until Midjourney finds a way to source copyrighted reference images that are used to mend the final images, no company is willing to use it for their projects.

Even though we have a Midjourney account at my marketing agency for us to use, we're not allowed to use it for any client work because we would get dropped immediately if any of our clients get sued for their using copyrighted material. Companies take that shyt very serious.

Until Midjourney gets the legal aspects of their generated art, no company is going to touch it with a ten foot pole.

Right now we only use Midjourney for concepting ideas, and most companies that have something to lose are probably doing the same.

the solution would be to build a AI model entirely on public domain works and create a massive synthetic image dataset based on it to try and increase the variation and quality of the training data.
 

ChatGPT-5

Superstar
Joined
May 17, 2013
Messages
17,986
Reputation
2,876
Daps
56,799
Until Midjourney finds a way to source copyrighted reference images that are used to mend the final images, no company is willing to use it for their projects.

Even though we have a Midjourney account at my marketing agency for us to use, we're not allowed to use it for any client work because we would get dropped immediately if any of our clients get sued for their using copyrighted material. Companies take that shyt very serious.

Until Midjourney gets the legal aspects of their generated art, no company is going to touch it with a ten foot pole.

Right now we only use Midjourney for concepting ideas, and most companies that have something to lose are probably doing the same.
easy way to go around that is to modify the image. in other words, use them as layers, not as a final.
 

Dillah810

Flat Girther
Joined
May 2, 2012
Messages
44,179
Reputation
10,224
Daps
170,910
Reppin
Flint, Michigan
the solution would be to build a AI model entirely on public domain works and create a massive synthetic image dataset based on it to try and increase the variation and quality of the training data.
Right now Adobe's Firefly uses only Adobe licensed images for training, so it's the only generative art program we're allowed to use on client work. However, since it only uses licenses imagery and Midjourney uses the entire internet, Adobe is miles behind Midjourney on quality. Firefly 2 is about as good as Midjourney v3. You can still get some decent stuff on Firefly with well crafted prompts though. Here are some Firefly images I made recently

Firefly-A-glass-jar-terrarium-filled-with-many-flowering-plants-foggy-morning-dreamy-96229.jpg

Firefly-an-abstract-3-D-object-with-with-amorphous-forms-made-from-glass-gold-cotton-dark-green-pl-1.jpg

Firefly-floating-futuristic-city-in-the-sky-with-moons-graphic-digital-vibrant-colors-dramatic-light.jpg

Firefly-floating-futuristic-city-in-the-sky-with-moons-graphic-digital-vibrant-colors-dramatic-light.jpg

Firefly-smartphone-on-a-cafe-table-near-a-window-while-it-raining-outside-98448.jpg
 

Dillah810

Flat Girther
Joined
May 2, 2012
Messages
44,179
Reputation
10,224
Daps
170,910
Reppin
Flint, Michigan
easy way to go around that is to modify the image. in other words, use them as layers, not as a final.
If any part of an image contains parts of it that you don't have the rights to, you can be sued.

When I'm making composite images. I have you show my creative director were every part of the image came from. I can't just slip in copyrighted stuff.
 
Top