101 on OpenAI's new text to video AI model, Sora - Marketing Interactive | Canada News Media
Connect with us

Tech

101 on OpenAI's new text to video AI model, Sora – Marketing Interactive

Published

 on


Artificial intelligence (AI) is developing at breakneck speed and this weekend, OpenAI delivered one of the biggest updates to the system that we have seen in a while. Sora is OpenAI’s newest AI model that can create realistic and imaginative scenes simply with text instructions. 

With Sora, industry professionals will now be able to create realistic and complex videos all without leaving their seat.

This is more important than ever before especially because consumers today watch more videos and the demand for short-form content has rapidly increased, with 66% finding the type of content to be the most engaging, according to a report done by Munch, an AI-powered automation platform for social media.

Don’t miss: Why OpenAI’s comms strategy was a tragic fail amidst the Sam Altman ousting saga

According to the report, video content is no longer an option, but a necessity for business and brands aiming for success, with 42% of businesses preferring Instagram and 26% preferring Facebook to post such videos. TikTok does not rank among the top three platform choices for marketers. 

With the importance of short-form video content in marketing efforts, here’s a breakdown of exactly what you need to know about Sora and how it can help industry professionals in the space. 

What is Sora?

Sora is OpenAI’s solution to getting AI to understand and simulate the physical world in motion, with the goal of training models that help people solve problems that require real-world interaction, it said in a statement. 

As such, Sora is a text-to-video model that can generate videos up to a minute long while maintaining visual quality and adherence to the user’s prompt.

Sora is able to generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background. The model understands not only what the user has asked for in the prompt, but also how those things exist in the physical world.

The model has a deep understanding of language which then enables it to accurately interpret prompts and generate compelling characters that express vibrant emotions. Sora can also create multiple shots within a single generated video that accurately depict characters and visual style.

“Specifically, we train text-conditional diffusion models jointly on videos and images of variable durations, resolutions and aspect ratios. We leverage a transformer architecture that operates on spacetime patches of video and image latent codes,” said OpenAI.

How does it work exactly?

This part is a bit technical but according to OpenAI, it takes inspiration from large language models which acquire generalist capabilities by training on internet-scale data.

“The success of the LLM paradigm is enabled in part by the use of tokens that elegantly unify diverse modalities of text—code, math and various natural languages. In this work, we consider how generative models of visual data can inherit such benefits,” it said. 

OpenAI explained in its technical report that where LLMs have text tokens, Sora has visual patches. Patches have previously been shown to be an effective representation for models of visual data.

“We find that patches are a highly scalable and effective representation for training generative models on diverse types of videos and images,” it said. 

Sora then is essentially a diffusion model, which generates a video by starting off with one that looks like static noise and gradually transforms it by removing the noise over many steps.

It is, as a result, capable of generating entire videos all at once or extending generated videos to make them longer.

The model also builds on past research in DALL·E and GPT models. It uses the recaptioning technique from DALL·E 3, which involves generating highly descriptive captions for the visual training data. As a result, the model is able to follow the user’s text instructions in the generated video more faithfully.

In addition to being able to generate a video solely from text instructions, the model is able to take an existing still image and generate a video from it, animating the image’s contents with accuracy and attention to small detail.

What are some of its weaknesses?

As with all AI models, there are weaknesses, bias and misinformation that can sometimes arise. Sora is no exception as OpenAI admits. 

Currently, Sora may struggle with accurately simulating the physics of a complex scene and may not understand specific instances of cause and effect. For example, a person might take a bite out of a cookie, but afterward, the cookie may not have a bite mark, said OpenAI. 

The model may also confuse spatial details of a prompt, for example, mixing up left and right, and may struggle with precise descriptions of events that take place over time, such as following a specific camera trajectory.

Ahead of its public launch, OpenAI said that it will be working with domain experts in areas such as misinformation, hateful content, and bias — who will be adversarially testing the model.

“We’re also building tools to help detect misleading content such as a detection classifier that can tell when a video was generated by Sora. We plan to include C2PA metadata in the future if we deploy the model in an OpenAI product,” it said. It added that it also will utalise its existing safety methods that it has already built for products that use DALL-E 3. 

For example, once in an OpenAI product, its text classifier will check and reject text input prompts that are in violation of its usage policies. These include those that request extreme violence, sexual content, hateful imagery, celebrity likeness, or the IP of others.

“We’ve also developed robust image classifiers that are used to review the frames of every video generated to help ensure that it adheres to our usage policies, before it’s shown to the user,” it said. 

OpenAI will also engage policymakers, educators and artists around the world to understand their concerns and to identify positive use cases for this new technology.

Adopting the technology in marketing

With all that said, the main question really still lies in how marketers and industry professionals can adopt the technology in their day-to-day work and according to industry professionals MARKETING-INTERACTIVE spoke to, it has the potential to be “stunning”.

According to Pramodh Rai, co-founder at Cyber Sierra, the capabilities, and low barrier to entry by Sora means that consumers have a “very high chance” of igniting experimentation in creative teams that include marketers and advertisers.

“Content creation is now revolutionised in very exciting ways. Our ability to prototype rapidly and produce high quality videos as well as dynamic visuals significantly reduces time to market and resources required for traditional ad campaigns,” said Rai. He added that routine editing tasks and content tailored to different platforms as well as audiences can be automated, which frees creative teams to focus on strategic and innovative aspects of their campaigns. 

“As the line between reality and AI is blurring thanks to advancements in AI such as with Sora,  personalised advertising through custom content is set to soar. Existing workflows can be streamlined to enable more collaboration between team members as well as tighter feedback loops. It looks like we can do this cheaply too, so it’s going to spark experimentation at new levels across society,” he said. 

Agreeing with him, Milind, an AI Scientist from Mercedes, who was expressing independent views, notes that from what has been shared so far, the capabilities of the model seem “quite amazing”.

“The consistency and quality of the videos over extended period of time is quite a breakthrough. It would be safe to say that for use cases such as hyper-personalised video content creation it would quite useful. I’m also sure that it will continue to improve offering sound generation and fine-grained control in future,” he said.  

Exercising caution around the technology

Saying that, one should not get too excited about the technology too quickly. According to Edwin Yeo, general manager at Strategic Public Relations Group, marketers need to be “wary” of speedily adopting Sora, or they do so with a “big degree of risk”. He said:

If there’s one thing we learned from advancement in technology, it’s that technology tends to outpace regulations and safety concerns.

He added that with Sora and generative AI in general, questions over usage and copyright still remain a big challenge for marketers and content producers.

Apart from copyright and safety concerns, there’s also a question still of quality, added Yeo. “It’s not great with hands, just like AI art, and there are still questions of the computing power needed to output videos in 4K or 8K formats.”

He added that personally, he has been using the likes of Midjourney for concept presentations. Once approved through, he will still go back to photography and DI. “I reckon for the immediate future, Sora will be similarly useful. That’s already a big impact in the marketing workflow, but we’re very far from Sora being able to replace video production,” said Yeo. 

Adding to his point, Rai noted that there are also still a number of potential brand safety concerns that marketers need to be wary of. 

“For one, deepfakes and misinformation constitute a new level of risk not seen before which could impact brand safety,” said Rai. Additionally, brands may face issues with inappropriate content generation that does not align with brand value or that may be offensive or insensitive. Rai said:

Brand authenticity may take a hit if the world starts to rely on AI generated content and less on human oversight.

Aside from these issues, marketers should also be wary of a lack of human input as AI models such as Sora could misinterpret creative briefs and also create data privacy and security challenges, which may lead to copyright infringement cases.

“Marketers need to use Sora for generating content that resonates with individual preferences and behaviours while placing humans in the part of the loop where the combination of creativity, strategy, analytics and inimitable personal touch shines,” said Rai.

Join us this coming 24 – 25 April for #Content360, a two-day extravaganza centered around four core thematic pillars: Explore with AI; Insight-powered strategies; Content as an experience; and Embrace the future. Immerse yourself in learning to curate content with creativity, critical thinking, and confidence with us at Content360!

Related articles:
OpenAI unveils platform for users to make custom GPTs
SG creates new Gen-AI frameworks, asks global input to build trust
What are Singaporeans making of the questioning of TikTok CEO?

Adblock test (Why?)



Source link

Continue Reading

Tech

Building Homes Faster with our Latest 3D Construction Printer

Published

 on

Copenhagen, 22.10.2024 – COBOD International, the global leader in 3D construction printing technology, proudly introduces the BOD3 3D Construction Printer for 3D printing of real concrete. Equipped with an extendable ground-based track system, the BOD3 advances the construction process by eliminating printer downtime between multiple buildings on the same site, setting a new benchmark for productivity and efficiency. The BOD3 is the most advanced solution for high-volume low-rise construction and a very effective alternative to conventional construction methods.

The heart and key feature of the new BOD3 3D printer is the advanced extendable ground-based track system. This system enables limitless extension along the Y-axes (length), expanding the printable area to cover 2 or 3 buildings, and reducing setup time to a single installation for multi-building projects. It’s a game-changer, allowing continuous, uninterrupted printing across large sites, increasing efficiency for high volume and mass production at an unmatchable scale.

 

Render of COBOD BOD3 3D Construction Printer.

The BOD3, COBOD’s third printer model, is the outstanding achievement of years of dedicated research, development, and close collaboration with customers. It is a vital advancement in automated construction technology, directly addressing the urgent global demand for faster, smarter, more efficient and sustainable building solutions. Like every COBOD 3D printer, the BOD3’s modular design offers customization, allowing it to easily adapt to any customer’s size wishes in addition to complying with the various sizes of construction sites anywhere in the world.

The BOD3 follows COBOD’s vision to build smarter through automation. Its operational stand combines the control and monitoring of both the 3D printer and supplementary equipment in one user-friendly system. The Advanced Hose Management System (AHMS) transports 3D printable material from the materials delivery system to the printhead via hoses secured within E-chains, minimizing physical labor and optimizing material flow. With the addition of the dual dosing system for additives, operators can better control the concrete and adapt it to onsite environmental conditions. By introducing additives directly at the printhead, the system reduces drying time between layers, speeding up the overall construction process. Designed for easy operation and precision, the BOD3 can be operated by a small, trained, and certified team, reducing the costs of projects.

Incorporating the innovative Universal X-Carriage, the BOD3 is ready for future COBOD advancements and technologies, like the introduction of additional tools for the printer aimed at insulating, painting, sanding, etc. This ensures long-term versatility and performance that will keep the BOD3 at the forefront of the industry for years to come.

 

Universal X-Carriage with Printhead.

Already deployed to the global market, the BOD3 is currently active in Indonesia, by Modula Tiga Dimensi, Angola, by Power2Build, and Bahrain, by Ab’aad 3D. The customers report faster project execution with near-zero downtime between individual buildings on the same site. The projects showcase the BOD3’s ability to speed up construction and print with real concrete, with 99% locally sourced materials and 1% of innovative D.fab, a co-developed solution by COBOD and Cemex to make concrete 3D printable.

Henrik Lund-Nielsen, Founder and General Manager of COBOD, commented on the BOD3: “The global housing crisis demands a more efficient construction solution that is faster, more efficient, and scalable. The BOD3 is our answer to this challenge. Drawing on years of research and expertise, we’ve designed the BOD3 with innovative features, making it our most cost-effective and efficient model yet for multiple low-rise buildings. Its design supports high-volume, linear production of houses, enabling mass production without compromising quality. The fact that six units have already been sold before its official launch speaks volumes about the BOD3’s market demand and the trust our customers place in our technology.

Michael Holm, Chief Innovation Officer at COBOD, states, “The advanced ground-based track system was developed as a response to our customers’ needs to increase efficiency and productivity. Now the 3D construction printer can be easily extended, and multiple consecutive structures can be printed with minimal repositioning and zero downtime between projects, making 3D construction printing more efficient than ever before.

 

The BOD3 is now available for purchase worldwide; for more information, please visit our website, www.cobod.com, or contact us at info@cobod.com.

 

RELEVANT LINKS

 

ABOUT COBOD

COBOD stands as the global leader in supplying 3D printers for the construction sector, with over 80 printers distributed across North and Latin America, Europe, the Middle East, Africa, and Asia-Pacific. Driven by a mission to revolutionize construction through multifunctional robots based on 3D printing, COBOD envisions automating half of the construction processes to achieve faster, cost-effective, sustainable results with enhanced design versatility.

From residential, commercial, and public buildings, COBOD’s 3D printers have been instrumental in erecting 1- to 3-story structures across all six inhabited continents. The innovative technology also extends to fabricate large-scale data centers, wind turbine towers, tanks, and more.

Embracing an open-source material approach, COBOD collaborates with global partners, including customers, academia, and suppliers. The company, backed by prominent shareholders such as General Electric, CEMEX, Holcim, and PERI, operates from its main office in Copenhagen, Denmark, and regional competence centers in Miami, Florida, and Kuala Lumpur, Malaysia. COBOD’s dynamic team comprises over 100 professionals from 25 diverse nationalities.

 

ABOUT MODULA TIGA DIMENSI

PT Modula Tiga Dimensi is a joint venture between Bakrie & Brothers (BNBR) and COBOD. BNBR focuses on offering and providing solutions for housing backlog problems currently encountered by the country.

Teaming up with COBOD International, the company is now set to adopt the latest 3D printing construction technology and is ready to offer the Indonesian market a new and better solution to housing obstructions.

 

ABOUT POWER2BUILD
Reshaping the construction sector and adapting it to urgent human needs.

Power2Build is a technology company for the construction industry, prepared to establish partnerships with private, public, and non-governmental organizations (NGOs) so that they can make the transition to Build 4.0 through 3DCP.

We offer our clients value-added services and high-quality projects, always with a multidisciplinary approach that brings together the necessary experience to deal with complex issues.

Continue Reading

Tech

Slack researcher discusses the fear, loathing and excitement surrounding AI in the workplace

Published

 on

 

SAN FRANCISCO (AP) — Artificial intelligence‘s recent rise to the forefront of business has left most office workers wondering how often they should use the technology and whether a computer will eventually replace them.

Those were among the highlights of a recent study conducted by the workplace communications platform Slack. After conducting in-depth interviews with 5,000 desktop workers, Slack concluded there are five types of AI personalities in the workplace: “The Maximalist” who regularly uses AI on their jobs; “The Underground” who covertly uses AI; “The Rebel,” who abhors AI; “The Superfan” who is excited about AI but still hasn’t used it; and “The Observer” who is taking a wait-and-see approach.

Only 50% of the respondents fell under the Maximalist or Underground categories, posing a challenge for businesses that want their workers to embrace AI technology. The Associated Press recently discussed the excitement and tension surrounding AI at work with Christina Janzer, Slack’s senior vice president of research and analytics.

Q: What do you make about the wide range of perceptions about AI at work?

A: It shows people are experiencing AI in very different ways, so they have very different emotions about it. Understanding those emotions will help understand what is going to drive usage of AI. If people are feeling guilty or nervous about it, they are not going to use it. So we have to understand where people are, then point them toward learning to value this new technology.

Q: The Maximalist and The Underground both seem to be early adopters of AI at work, but what is different about their attitudes?

A: Maximalists are all in on AI. They are getting value out of it, they are excited about it, and they are actively sharing that they are using it, which is a really big driver for usage among others.

The Underground is the one that is really interesting to me because they are using it, but they are hiding it. There are different reasons for that. They are worried they are going to be seen as incompetent. They are worried that AI is going to be seen as cheating. And so with them, we have an opportunity to provide clear guidelines to help them know that AI usage is celebrated and encouraged. But right now they don’t have guidelines from their companies and they don’t feel particularly encouraged to use it.

Overall, there is more excitement about AI than not, so I think that’s great We just need to figure out how to harness that.

Q: What about the 19% of workers who fell under the Rebel description in Slack’s study?

A: Rebels tend to be women, which is really interesting. Three out of five rebels are women, which I obviously don’t like to see. Also, rebels tend to be older. At a high level, men are adopting the technology at higher rates than women.

Q: Why do you think more women than men are resisting AI?

A: Women are more likely to see AI as a threat, more likely to worry that AI is going to take over their jobs. To me, that points to women not feeling as trusted in the workplace as men do. If you feel trusted by your manager, you are more likely to experiment with AI. Women are reluctant to adopt a technology that might be seen as a replacement for them whereas men may have more confidence that isn’t going to happen because they feel more trusted.

Q: What are some of the things employers should be doing if they want their workers to embrace AI on the job?

A: We are seeing three out of five desk workers don’t even have clear guidelines with AI, because their companies just aren’t telling them anything, so that’s a huge opportunity.

Another opportunity to encourage AI usage in the open. If we can create a culture where it’s celebrated, where people can see the way people are using it, then they can know that it’s accepted and celebrated. Then they can be inspired.

The third thing is we have to create a culture of experimentation where people feel comfortable trying it out, testing it, getting comfortable with it because a lot of people just don’t know where to start. The reality is you can start small, you don’t have to completely change your job. Having AI write an email or summarize content is a great place to start so you can start to understand what this technology can do.

Q: Do you think the fears about people losing their jobs because of AI are warranted?

A: People with AI are going to replace people without AI.

The Canadian Press. All rights reserved.

Source link

Continue Reading

Tech

Biden administration to provide $325 million for new Michigan semiconductor factory

Published

 on

 

WASHINGTON (AP) — The Biden administration said Tuesday that it would provide up to $325 million to Hemlock Semiconductor for a new factory, a move that could help give Democrats a political edge in the swing state of Michigan ahead of election day.

The funding would support 180 manufacturing jobs in Saginaw County, where Republicans and Democrats were neck-in-neck for the past two presidential elections. There would also be construction jobs tied to the factory that would produce hyper-pure polysilicon, a building block for electronics and solar panels, among other technologies.

Commerce Secretary Gina Raimondo said on a call with reporters that the funding came from the CHIPS and Science Act, which President Joe Biden signed into law in 2022. It’s part of a broader industrial strategy that the campaign of Vice President Kamala Harris, the Democratic nominee, supports, while Republican nominee Donald Trump, the former president, sees tariff hikes and income tax cuts as better to support manufacturing.

“What we’ve been able to do with the CHIPS Act is not just build a few new factories, but fundamentally revitalize the semiconductor ecosystem in our country with American workers,” Raimondo said. “All of this is because of the vision of the Biden-Harris administration.”

A senior administration official said the timing of the announcement reflected the negotiating process for reaching terms on the grant, rather than any political considerations. The official insisted on anonymity to discuss the process.

After site work, Hemlock Semiconductor plans to begin construction in 2026 and then start production in 2028, the official said.

Running in 2016, Trump narrowly won Saginaw County and Michigan as a whole. But in 2020 against Biden, both Saginaw County and Michigan flipped to the Democrats.

The Canadian Press. All rights reserved.

Source link

Continue Reading

Trending

Exit mobile version