Tag: OpenAI

Yes! OpenAI’s Sora could revolutionise marketing content creation

By Omar H. Fares

OpenAI’s new generative Sora tool has sparked lively technology discussions over the past week, generating both enthusiasm and concern among fans and critics.

Sora is a text-to-video model that significantly advances the integration of deep learning, natural language processing and computer vision to transform textual prompts into detailed and coherent life-like video content.

In contrast to previous text-to-video technologies, like Meta’s Make-A-Video, Sora is able to overcome limitations related to the type of visual data it can interpret, video length and resolution.

From what OpenAI has demonstrated, Sora can generate videos of various lengths, from short clips to full-minute narratives, and in high definition, accommodating a wide range of creative needs.

Although no official release date has been announced, Sora will likely be available to the public in the coming months, judging by OpenAI’s typical pattern of public releases. For now, it’s only available to experts and a few artists and filmmakers.

How Sora works

At the heart of Sora’s innovation is a technique that transforms visual data into a format it can easily understand and manipulate, similar to how words are broken down into tokens for AI processing by text-based applications.

This process involves compressing video data into a more manageable form and breaking it down into patches or segments. These segments act like building blocks that Sora can rearrange to create new videos.

Sora uses a combination of deep learning, natural language processing and computer vision to achieve its capabilities.

Deep learning helps it understand and generate complex patterns in data, natural language processing interprets text prompts to create videos, and computer vision allows it to understand and generate visual content accurately.

By employing a diffusion model — a type of model that’s particularly good at generating high-quality images and videos — Sora can take noisy, incomplete data and transform it into clear, coherent video content.

Sora’s approach differs from CGI character creation, which requires extensive manual effort, and from traditional deepfake technologies, which often lack ethical safeguards, by offering a scalable and adaptable method for generating video content based on textual input.

What does this mean for businesses?

One of the most noteworthy aspects of Sora is its flexibility, as it supports various video formats and sizes, enhances framing and composition for a professional finish, and accepts text, images or videos as prompts for animating images or extending videos.

The emergence of Sora presents key opportunities for businesses across different sectors. In the near future, there are two key areas that may have significant applications.

The first area is in marketing and advertising. Just as ChatGPT has become a marketing and content creation tool, we can expect businesses to use Sora for similar reasons.

With the public release of Sora, brands and companies will be able to create highly engaging and visually appealing video content for marketing campaigns, social media and advertisements.

The ability to generate custom videos based on textual prompts will allow for greater creativity and personalisation, possibly helping brands stand out in a crowded market.

The second area Sora could impact is training and education. Companies could use Sora to develop educational and training videos that are tailored to specific topics or scenarios. This could enhance the learning experience for employees and customers, making complex information more accessible and engaging.

Other sectors, such as e-commerce, also hold promising potential for the future application of Sora. Retailers could create dynamic product demonstrations that effectively showcase products in a more engaging and interactive manner.

This would be especially beneficial for companies that want to highlight specific aspects of products that might not be easily conveyed through static images or text, or for advertising products that require a detailed explanation.

Sora could also significantly reduce the uncertainty associated with online shopping by facilitating virtual try-on experiences, allowing customers to visualize how a product, such as clothing or accessories, would look on them without the need for a physical fitting. This, in turn, could result in a better return on investment.

What are the key challenges ahead?

While there are key opportunities ahead, OpenAI, regulators and users need to carefully consider key factors that could pose challenges, including copyright issues, ethical concerns and the consequences of increased digital noise.

With Sora’s ability to generate lifelike video content, there’s a risk of inadvertently creating videos that infringe on existing copyrights. OpenAI has already been sued several times over copyright infringement and intellectual property issues.

OpenAI hasn’t disclosed where the data used to train Sora is from, but it did tell the New York Times it was training the system using videos that were publicly available and licensed from copyright holders.

The technology also raises ethical questions, particularly around the creation of deepfake videos or misleading content.

Establishing guidelines and safeguards to prevent misuse will be essential for maintaining trust in the technology. In a post on its website, OpenAI stated it was working with experts to test the model before releasing it to the public.

As more businesses and individuals gain access to Sora, there’s a potential for an increase in low-quality or irrelevant video content, leading to increased “digital noise” that could overwhelm users. Finding ways to filter and curate content will become increasingly important for businesses looking to maintain their edge.

Last, but certainly not least, is the question of how Sora will impact the job market for content creators. While Sora does have the potential to automate certain aspects of video production, like ChatGPT, it’s unlikely to replace human creativity and insight anytime soon.

Instead, Sora could serve as a tool that enhances the capabilities of content creators, allowing them to produce higher-quality content more efficiently. As with any technological advancement, the key will be for professionals to adapt and find ways to integrate Sora into their workflows, leveraging its strengths to complement their own skills and creativity.

Omar H. Fares is Lecturer in the Ted Rogers School of Retail Management, Toronto Metropolitan University. This article is republished from The Conversation under a Creative Commons license. Read the original article.

February 27, 2024
As ChatGPT turns 1, the AI chatbot’s success says as much about humans as technology, writes Tim Gorichanaz

On ChatGPT’s first anniv, rejoice?

By Tim Gorichanaz

ChatGPT was launched on Nov. 30, 2022, ushering in what many have called artificial intelligence’s breakout year. Within days of its release, ChatGPT went viral. Screenshots of conversations snowballed across social media, and the use of ChatGPT skyrocketed to an extent that seems to have surprised even its maker, OpenAI. By January, ChatGPT was seeing 13 million unique visitors each day, setting a record for the fastest-growing user base of a consumer application.

Throughout this breakout year, ChatGPT has revealed the power of a good interface and the perils of hype, and it has sown the seeds of a new set of human behaviors. As a researcher who studies technology and human information behaviour, I find that ChatGPT’s influence in society comes as much from how people view and use it as the technology itself.

Generative AI systems like ChatGPT are becoming pervasive. Since ChatGPT’s release, some mention of AI has seemed obligatory in presentations, conversations and articles. Today, OpenAI claims 100 million people use ChatGPT every week.

Besides people interacting with ChatGPT at home, employees at all levels up to the C-suite in businesses are using the AI chatbot. In tech, generative AI is being called the biggest platform since the iPhone, which debuted in 2007. All the major players are making AI bets, and venture funding in AI startups is booming.

Along the way, ChatGPT has raised numerous concerns, such as its implications for disinformation, fraud, intellectual property issues and discrimination. In my world of higher education, much of the discussion has surrounded cheating, which has become a focus of my own research this year.

Lessons from ChatGPT’s first year

The success of ChatGPT speaks foremost to the power of a good interface. AI has already been part of countless everyday products for well over a decade, from Spotify and Netflix to Facebook and Google Maps. The first version of GPT, the AI model that powers ChatGPT, dates back to 2018. And even OpenAI’s other products, such as DALL-E, did not make the waves that ChatGPT did immediately upon its release. It was the chat-based interface that set off AI’s breakout year.

There is something uniquely beguiling about chat. Humans are endowed with language, and conversation is a primary way people interact with each other and infer intelligence. A chat-based interface is a natural mode for interaction and a way for people to experience the “intelligence” of an AI system. The phenomenal success of ChatGPT shows again that user interfaces drive widespread adoption of technology, from the Macintosh to web browsers and the iPhone. Design makes the difference.

At the same time, one of the technology’s principal strengths – generating convincing language – makes it well suited for producing false or misleading information. ChatGPT and other generative AI systems make it easier for criminals and propagandists to prey on human vulnerabilities. The potential of the technology to boost fraud and misinformation is one of the key rationales for regulating AI.

Amid the real promises and perils of generative AI, the technology has also provided another case study in the power of hype. This year has brought no shortage of articles on how AI is going to transform every aspect of society and how the proliferation of the technology is inevitable.

ChatGPT is not the first technology to be hyped as “the next big thing,” but it is perhaps unique in simultaneously being hyped as an existential risk. Numerous tech titans and even some AI researchers have warned about the risk of superintelligent AI systems emerging and wiping out humanity, though I believe that these fears are far-fetched.

The media environment favors hype, and the current venture funding climate further fuels AI hype in particular. Playing to people’s hopes and fears is a recipe for anxiety with none of the ingredients for wise decision making.

What the future may hold

The AI floodgates opened in 2023, but the next year may bring a slowdown. AI development is likely to meet technical limitations and encounter infrastructural hurdles such as chip manufacturing and server capacity. Simultaneously, AI regulation is likely to be on the way.

This slowdown should give space for norms in human behavior to form, both in terms of etiquette, as in when and where using ChatGPT is socially acceptable, and effectiveness, like when and where ChatGPT is most useful.

ChatGPT and other generative AI systems will settle into people’s workflows, allowing workers to accomplish some tasks faster and with fewer errors. In the same way that people learned “to google” for information, humans will need to learn new practices for working with generative AI tools.

But the outlook for 2024 isn’t completely rosy. It is shaping up to be a historic year for elections around the world, and AI-generated content will almost certainly be used to influence public opinion and stoke division. Meta may have banned the use of generative AI in political advertising, but this isn’t likely to stop ChatGPT and similar tools from being used to create and spread false or misleading content.

Political misinformation spread across social media in 2016 as well as in 2020, and it is virtually certain that generative AI will be used to continue those efforts in 2024. Even outside social media, conversations with ChatGPT and similar products can be sources of misinformation on their own.

As a result, another lesson that everyone – users of ChatGPT or not – will have to learn in the blockbuster technology’s second year is to be vigilant when it comes to digital media of all kinds.

Tim Gorichanaz is Assistant Teaching Professor of Information Science, Drexel University. This article is republished from The Conversation under a Creative Commons license. Read the original article.

December 12, 2023
The rush to find good use cases for Generative AI is spawning a new class of start-ups, writes Ashoke Agarrwal

Will 2025 be the year of the arrival of Concierge Intelligence?

By Ashoke Agarrwal

Since OpenAI launched ChatGPT in November 2022, many have heralded (and some) feared the arrival of the age of Artificial Intelligence (AI)?

The rush to find good use cases for Generative AI is spawning a new class of start-ups and keeping Angels and VCs busy.

I have always thought the Information Age was a way-stop on the road to the Age of AI. I have also surmised that the Age of AI will amplify the gains and ills of the Information Age. However, being an eternal optimist, I have always focused on the good technology can do.

In February 2021, in the gloom of the Covid lockdown, when the world had barely an inkling of what Generative AI was, I imagined a use case of AI I called Concierge Intelligence and published a blog post about it.

Here are some excerpts from the post:

“I believe one of the critical directions Artificial Intelligence will develop over the next decade is what I call “Concierge Intelligence”.

Concierge Intelligence will go a long way towards fulfilling the initial promise of the digital age.”

“The era of Concierge Intelligence will avoid the concerns raised by the age of marketing to bots like Alexa or Sirir that I wrote about in my post of April 25th 2018, titled “Marketing to Bots: The Coming Paradigm Shift?”

Concierge Intelligence will instead be the emergence of AI with an agency. The kind of agency that I wrote about in my post dated June 14th 2019, titled “Machine Intelligence to Machine Curiosity – The Route to Machine Creativity”, as also in my post dated December 19th 2019, titled “Should AI Have Agency.”

The individual will buy his Concierge Intelligence (CI) – a software application -from the market and load on onto all the devices she uses. I believe CI will be the next big thing in consumer marketing. CI will get to work to learn the consumer’s interests and preferences. The individual will set the scope and depth of this learning. I can imagine the emergence of a new form of Yoga – CI Yoga! CI Yoga trainers will coach the individual on how to refine their CI settings for maximum well-being.

CI will mediate between the world and the individual. It will map your learning patterns and maximize the speed and efficacy of your learning. It will continuously keep a tab on the individual’s inherent talents and emergent capabilities and connect her with opportunities to use these talents and abilities, in the process not just maximizing her earnings but increasing her sense of self-worth. It will perceive the individual’s relationship and leisure needs and help her meet them. One of the minor duties of CI will be as the gatekeeper to brands and services that seek to message and sell to the individual. While the CI will have powerful capabilities, it will be under the total command of the individual. She can change its functionalities whenever she wants and even switch it off if she so desires, much like today’s smartphones.

Over the next decade, CI will become the most widely prevalent form of AI. I like to think of a CI as AI with a soul. A form of augmented intelligence that fuses an individual’s psyche, with all its complexity and humanity intact, with AI’s power, speed and reach.”

My concept of CI has so taken hold of me that I even wrote about it in my first column for MxMIndia in January 2022, titled “The Coming Post-Digital World.”

Post ChatGPT, the concept of an individual-owned and operated AI model is in the air. Sources in the VC world now tell me that a couple of start-ups are proposing systems close to the CI concept. While musings in blog posts do not give me any monetisable rights, I am glad that while some of my prognostications target a future too far out to find vindication in my lifetime, the CI concept will, in some measure, come true in the next couple of years. I would bet on Apple to be the company that will lead the world into the age of CI. Its current stance of developing AI systems that reside entirely on the user’s AI device and use user information without transmitting out of the device is a stepping stone to CI systems. Plus, of course, the fact that it is the most resourced company in the world and among the most trusted brands.

July 6, 2023
The humanness of artificial intelligence intrigues me and I for one have never been afraid of it, writes Prabhakar Mundkur

Chatting with ChatGPT

Photo : Tara Winstead on pexels.com

If you are an artificial intelligence fan like me, you have no doubt watched all the seasons of Westworld. Or you might have been an avid watcher of Stanley Kubricks 2001 Space Odessey and HAL’s famous words from that movie when he says: ‘I am putting myself to the fullest possible use, which is all I think that any conscious entity can ever hope to do.’ That was as human as human can get.

When I first heard about ChatGPT, I was intrigued. I had played around with IBM Watson many years ago and even used it to make some revealing business analyses and business decisions. So what was ChatGPT going to be like? The humanness of artificial intelligence intrigues me and I for one have never been afraid of it.

Humanity’s biggest threat has always been itself. ~Engerraund Serac in Westworld

In fact ChatGPT interacts with you in a very human way. You could have a conversation or you could ask it questions. What I found was that the interactions are certainly very human, I wouldn’t know the difference.

What is ChatGPT?

ChatGPT is an open chatbot system released just three months ago by OpenAI, a for-profit research lab. You can almost ask it any question. And it will go thru swaths of data from the internet to deliver a dialogue with you. ChatGPT, according to its inventors, is still not perfect and it is still ‘learning.’ The biggest challenge for AI researcher to tackle is the Turing Test. You would have heard of that in the movie ‘The Imitation Game’ that portrayed the life of English mathematician and computer scientist Alan Turing. Turing proposed the following test to gauge intelligence. Can a human conversing with a human and with a computer tell which is which?

UBS Analyst Lloyd Walmsley estimated that 100 million users had used ChatGPT in January 2023 in two months after launch what took Instagram two and half years.

What kind of questions can you ask ChatGPT? Mostly everything from asking it questions in quantum physics to asking it to write a poem. Here are some of the questions I asked.

PM: What is Schroedinger’s Cat?

Chat GPT: Schrödinger’s Cat is a thought experiment in quantum mechanics, proposed by Austrian physicist Erwin Schrödinger in 1935. The experiment describes a scenario in which a cat is placed in a sealed box with a radioactive substance and a poison, such that the radioactive decay of the substance has a 50-50 chance of killing the cat. The idea is to demonstrate the concept of superposition in quantum mechanics, where a particle can exist in multiple states at the same time, and the act of observation collapses the wave function and determines its state. In the case of Schrödinger’s cat, the cat is both alive and dead at the same time until the box is opened and the state is observed.

Or I asked it to write a love poem and this is what I got.

Love is a rose, with petals soft and bright,

A flame that dances, illuminating the night.

A treasure to hold, a gift to share,

A feeling that warms, a bond that’s rare.

It’s the twinkle in your eye, the skip of your heart,

The laughter that echoes, when we’re worlds apart.

It’s the comfort you bring, the peace you bring to my soul,

The place I call home, the place I call home.

With you, I am whole, I am free, I am me,

Together we soar, through the highs and the lows.

And though the winds may howl, and the skies may turn gray,

Our love will endure, with you, come what may.

So here’s to the laughter, the tears and the joy,

To a love that is true, a bond we’ll enjoy.

With you, my dear, my heart overflows,

With love and with life, with love that glows.

Interesting isn’t it? Chatbots are based on Natural Language Processing or NLP. Chatbots are being used for internal business communication, enhancing customer service and to boost online commerce.

What is the future of AI? I think so far all of what we thought was science fiction has pretty much turned out to be reality. I wouldn’t be surprised if our future world was a lot like Westworld.

Prabhakar Mundkur is a veteran adperson, a prolific writer and musician. He writes for MxMIndia on and off. His views are personal

February 7, 2023