Generative artificial intelligence

From Wikipedia, the free encyclopedia

A detailed oil painting of figures in a futuristic opera scene
Théâtre d'Opéra Spatial, an image generated by Midjourney

Generative artificial intelligence (also generative AI or GenAI[1]) is artificial intelligence capable of generating text, images, or other media, using generative models.[2][3][4] Generative AI models learn the patterns and structure of their input training data and then generate new data that has similar characteristics.[5][6]

In the early 2020s, advances in transformer-based deep neural networks enabled a number of generative AI systems notable for accepting natural language prompts as input. These include large language model chatbots such as ChatGPT, Copilot, Bard, and LLaMA, and text-to-image artificial intelligence art systems such as Stable Diffusion, Midjourney, and DALL-E.[7][8][9]

Generative AI has uses across a wide range of industries, including art, writing, script writing, software development, product design, healthcare, finance, gaming, marketing, and fashion.[10][11][12] Investment in generative AI surged during the early 2020s, with large companies such as Microsoft, Google, and Baidu as well as numerous smaller firms developing generative AI models.[2][13][14] However, there are also concerns about the potential misuse of generative AI, including cybercrime or creating fake news or deepfakes which can be used to deceive or manipulate people.[15][16]


The academic discipline of artificial intelligence was founded at a research workshop at Dartmouth College in 1956, and has experienced several waves of advancement and optimism in the decades since.[17] Since its founding, researchers in the field have raised philosophical and ethical arguments about the nature of the human mind and the consequences of creating artificial beings with human-like intelligence; these issues have previously been explored by myth, fiction and philosophy since antiquity.[18] These concepts of automated art date at least to the automata of ancient Greek civilization, where inventors such as Daedalus and Hero of Alexandria were described as having designed machines capable of writing text, generating sounds, and playing music.[19][20] The tradition of creative automatons has flourished throughout history, such as Maillardet's automaton, created in the early 1800s.[21]

Artificial Intelligence is an idea that has been captivating society since the mid-20th century. It began with science fiction familiarizing the world with the concept but the idea wasn't fully seen in the scientific manner until Alan Turing, a polymath, was curious about the feasibility of the concept. Turing's groundbreaking 1950 paper, "Computing Machinery and Intelligence," posed fundamental questions about machine reasoning similar to human intelligence, significantly contributing to the conceptual groundwork of AI. The development of AI was not very rapid at first because of the high costs and the fact that computers were not able to store commands. This changed during the 1956 Dartmouth Summer Research Project on AI where there was an inspiring call for AI research which led it to be a landmark event as it set the precedent for two decades of rapid advancements in the field.[22]

Since the founding of AI in the 1950s, artists and researchers have used artificial intelligence to create artistic works. By the early 1970s, Harold Cohen was creating and exhibiting generative AI works created by AARON, the computer program Cohen created to generate paintings.[23]

Markov chains have long been used to model natural languages since their development by Russian mathematician Andrey Markov in the early 20th century. Markov published his first paper on the topic in 1906,[24][25][26] and analyzed the pattern of vowels and consonants in the novel Eugeny Onegin using Markov chains. Once a Markov chain is learned on a text corpus, it can then be used as a probabilistic text generator.[27][28]

The field of machine learning often uses statistical models, including generative models, to model and predict data. Beginning in the late 2000s, the emergence of deep learning drove progress and research in image classification, speech recognition, natural language processing and other tasks. Neural networks in this era were typically trained as discriminative models, due to the difficulty of generative modeling.[29]

In 2014, advancements such as the variational autoencoder and generative adversarial network produced the first practical deep neural networks capable of learning generative, rather than discriminative, models of complex data such as images. These deep generative models were the first able to output not only class labels for images, but to output entire images.

In 2017, the Transformer network enabled advancements in generative models compared to older Long-Short Term Memory models,[30] leading to the first generative pre-trained transformer (GPT), known as GPT-1, in 2018.[31] This was followed in 2019 by GPT-2 which demonstrated the ability to generalize unsupervised to many different tasks as a Foundation model.[32]

In 2021, the release of DALL-E, a transformer-based pixel generative model, followed by Midjourney and Stable Diffusion marked the emergence of practical high-quality artificial intelligence art from natural language prompts.

In March 2023, GPT-4 was released. A team from Microsoft Research argued that "it could reasonably be viewed as an early (yet still incomplete) version of an artificial general intelligence (AGI) system".[33] Other scholars have disputed that GPT-4 reaches this threshold, calling generative AI "still far from reaching the benchmark of ‘general human intelligence’" as of 2023.[34]


A generative AI system is constructed by applying unsupervised or self-supervised machine learning to a data set. The capabilities of a generative AI system depend on the modality or type of the data set used.

Generative AI can be either unimodal or multimodal; unimodal systems take only one type of input, whereas multimodal systems can take more than one type of input.[35] For example, one version of OpenAI's GPT-4 accepts both text and image inputs.[36]


A user conversing with a simulation of Ludwig Wittgenstein

Generative AI systems trained on words or word tokens include GPT-3, LaMDA, LLaMA, BLOOM, GPT-4, and others (see List of large language models). They are capable of natural language processing, machine translation, and natural language generation and can be used as foundation models for other tasks.[37] Data sets include BookCorpus, Wikipedia, and others (see List of text corpora).


In addition to natural language text, large language models can be trained on programming language text, allowing them to generate source code for new computer programs.[38] Examples include OpenAI Codex.


Stable Diffusion, prompt a photograph of an astronaut riding a horse

Producing high-quality visual art is a prominent application of generative AI.[39] Many such artistic works have received public awards and recognition.

Generative AI systems trained on sets of images with text captions include Imagen, DALL-E, Midjourney, Adobe Firefly, Stable Diffusion and others (see Artificial intelligence art, Generative art, and Synthetic media). They are commonly used for text-to-image generation and neural style transfer.[40] Datasets include LAION-5B and others (See List of datasets in computer vision and image processing).


Generative AI can also be trained extensively on audio clips to produce natural-sounding speech synthesis and text-to-speech capabilities, exemplified by ElevenLabs' context-aware synthesis tools or Meta Platform's Voicebox.[41]

AI-generated music from the Riffusion Inference Server, prompted with bossa nova with electric guitar

Generative AI systems such as MusicLM[42] and MusicGen[43] can also be trained on the audio waveforms of recorded music along with text annotations, in order to generate new musical samples based on text descriptions such as a calming violin melody backed by a distorted guitar riff.


Runway Gen2, prompt A golden retriever in a suit sitting at a podium giving a speech to the white house press corps

Generative AI trained on annotated video can generate temporally-coherent video clips. Examples include Gen-1 and Gen-2 by Runway[44] and Make-A-Video by Meta Platforms.[45]


Generative AI systems can be trained on sequences of amino acids or molecular representations such as SMILES representing DNA or proteins. These systems, such as AlphaFold, are used for protein structure prediction and drug discovery.[46] Datasets include various biological datasets.


Generative AI can also be trained on the motions of a robotic system to generate new trajectories for motion planning or navigation. For example, UniPi from Google Research uses prompts like "pick up blue bowl" or "wipe plate with yellow sponge" to control movements of a robot arm.[47] Multimodal "vision-language-action" models such as Google's RT-2 can perform rudimentary reasoning in response to user prompts and visual input, such as picking up a toy dinosaur when given the prompt pick up the extinct animal at a table filled with toy animals and other objects.[48]


The terms generative AI planning or generative planning were used in the 1980s and 1990s to refer to AI planning systems, especially computer-aided process planning, used to generate sequences of actions to reach a specified goal.[49][50]

Generative AI planning systems used symbolic AI methods such as state space search and constraint satisfaction and were a "relatively mature" technology by the early 1990s. They were used to generate crisis action plans for military use,[51] process plans for manufacturing[49] and decision plans such as in prototype autonomous spacecraft.[52]

Software and hardware[edit]

Generative AI models are used to power chatbot products such as ChatGPT, programming tools such as GitHub Copilot,[53] text-to-image products such as Midjourney, and text-to-video products such as Runway Gen-2.[54] Generative AI features have been integrated into a variety of existing commercially available products such as Microsoft Office,[55] Google Photos,[56] and Adobe Photoshop.[57] Many generative AI models are also available as open-source software, including Stable Diffusion and the LLaMA[58] language model.

Smaller generative AI models with up to a few billion parameters can run on smartphones, embedded devices, and personal computers. For example, LLaMA-7B (a version with 7 billion parameters) can run on a Raspberry Pi 4[59] and one version of Stable Diffusion can run on an iPhone 11.[60]

Larger models with tens of billions of parameters can run on laptop or desktop computers. To achieve an acceptable speed, models of this size may require accelerators such as the GPU chips produced by Nvidia and AMD or the Neural Engine included in Apple silicon products. For example, the 65 billion parameter version of LLaMA can be configured to run on a desktop PC.[61]

Language models with hundreds of billions of parameters, such as GPT-4 or PaLM, typically run on datacenter computers equipped with arrays of GPUs (such as Nvidia's H100) or AI accelerator chips (such as Google's TPU). These very large models are typically accessed as cloud services over the Internet.

In 2022, the United States New Export Controls on Advanced Computing and Semiconductors to China imposed restrictions on exports to China of GPU and AI accelerator chips used for generative AI.[62] Chips such as the Nvidia A800[63] and the Biren Technology BR104[64] were developed to meet the requirements of the sanctions.

There is free software on the market capable of recognizing text generated by generative artificial intelligence (such as GPTZero), as well as images, audio or video coming from it.[65]


The impact of AI on numerous industries has been profound, revolutionizing productivity, decision-making processes, and customer experiences. However, amidst this progress, challenges and concerns have emerged.

The development of generative AI has raised concerns from governments, businesses, and individuals, resulting in protests, legal actions, calls to pause AI experiments, and actions by multiple governments. In a July 2023 briefing of the United Nations Security Council, Secretary-General António Guterres stated "Generative AI has enormous potential for good and evil at scale", that AI may "turbocharge global development" and contribute between $10 and $15 trillion to the global economy by 2030, but that its malicious use "could cause horrific levels of death and destruction, widespread trauma, and deep psychological damage on an unimaginable scale".[66]

Job losses[edit]

A picketer at the 2023 Writers Guild of America strike. While not a top priority, one of the WGA's 2023 requests was "regulations around the use of (generative) AI".[67]

From the early days of the development of AI, there have been arguments put forward by ELIZA creator Joseph Weizenbaum and others about whether tasks that can be done by computers actually should be done by them, given the difference between computers and humans, and between quantitative calculations and qualitative, value-based judgements.[68] In April 2023, it was reported that image generation AI has resulted in 70% of the jobs for video game illustrators in China being lost.[69][70] In July 2023, developments in generative AI contributed to the 2023 Hollywood labor disputes. Fran Drescher, president of the Screen Actors Guild, declared that "artificial intelligence poses an existential threat to creative professions" during the 2023 SAG-AFTRA strike.[71] Voice generation AI has been seen as a potential challenge to the voice acting sector.[72][73]

The intersection of AI and employment concerns among underrepresented groups globally remains a critical facet. While AI promises efficiency enhancements and skill acquisition, concerns about job displacement and biased recruiting processes persist among these groups, as outlined in surveys by Fast Company. To leverage AI for a more equitable society, proactive steps encompass mitigating biases, advocating transparency, respecting privacy and consent, and embracing diverse teams and ethical considerations. Strategies involve redirecting policy emphasis on regulation, inclusive design, and education's potential for personalized teaching to maximize benefits while minimizing harms.[74]


In the financial realm, significant investment surges, as highlighted in discussions by Daron Acemoglu, have led to transformative tools like robo-advisors, reshaping traditional financial practices. Acemoglu's warnings about potential adverse societal consequences stemming from AI, particularly in data harvesting, customer manipulation, and labor market disparities, underscore the complexities of AI's impact on society.[75]

Social Identities[edit]

AI's integration with social identities, elucidated by Marcin Frackiewiczin, holds both promises and challenges. AI's ability to transform traditional research methods, unveiling nuanced patterns within Social identity realms, has immense potential. However, biases ingrained in AI systems perpetuate stereotypes and marginalize groups, emphasizing the critical need to address these biases for inclusivity.[76]


Deepfakes (a portmanteau of "deep learning" and "fake"[77]) are AI-generated media that take a person in an existing image or video and replace them with someone else's likeness using artificial neural networks.[78] Deepfakes have garnered widespread attention and concerns for their uses in deepfake celebrity pornographic videos, revenge porn, fake news, hoaxes, and financial fraud.[79][80][81][82] This has elicited responses from both industry and government to detect and limit their use.[83][84]

Audio deepfakes[edit]

Instances of users abusing software to generate controversial statements in the vocal style of celebrities, public officials, and other famous individuals have raised ethical concerns over voice generation AI.[85][86][87][88][89][90] In response, companies such as ElevenLabs have stated that they would work on mitigating potential abuse through safeguards and identity verification.[91]

Concerns and fandom have spawned from AI generated music. The same software used to clone voices have been used on famous musicians' voices to create songs which mimic their voices, gaining both tremendous popularity and criticism.[92][93][94] Similar techniques have also been used to create improved quality or full-length versions of songs that have been leaked or have yet to been released.[95]

Generative AI has also been used to create new digital artist personalities, with some of these receiving enough attention to receive record deals at major labels.[96] The developers of these virtual artists have also faced their fair share of criticism for their personified programs, including backlash for "dehumanizing" an artform, and also creating artists which create unrealistic or immoral appeals to their audiences.[97]


Generative AI's ability to create realistic fake content has been exploited in numerous types of cybercrime, including phishing scams.[98] Deepfake video and audio have been used to create disinformation and fraud. Former Google fraud czar Shuman Ghosemajumder has predicted that while deepfake videos initially created a stir in the media, they would soon become commonplace, and as a result, more dangerous.[99] Additionally, large-language models and other forms of text-generation AI have been at a broad scale to create fake reviews on ecommerce websites to boost ratings.[100] Cybercriminals have created large language models focused on fraud, including WormGPT and FraudGPT.[101]

Recent research done in 2023 has revealed that generative AI has weaknesses that can be manipulated by criminals to extract harmful information bypassing ethical safeguards. The study presents example attacks done on ChatGPT including Jailbreaks and reverse psychology. Additionally, malicious individuals can use ChatGPT for social engineering attacks and phishing attacks, revealing the harmful side of these technologies.[102]

Misuse in journalism[edit]

In January 2023, broke the story that CNET had been using an undisclosed internal AI tool to write at least 77 of its stories; after the news broke, CNET posted corrections to 41 of the stories.[103]

In April 2023, German tabloid Die Aktuelle published a fake AI-generated interview with former racing driver Michael Schumacher, who had not made any public appearances since 2013 after sustaining a brain injury in a skiing accident. The story included two possible disclosures: the cover included the line "deceptively real", and the interview included an acknowledgement at the end that it was AI-generated. The editor-in-chief was fired shortly thereafter amid the controversy.[104]


In the European Union, the proposed Artificial Intelligence Act includes requirements to disclose copyrighted material used to train generative AI systems, and to label any AI-generated output as such.[105][106]

In the United States, a group of companies including OpenAI, Alphabet, and Meta signed a voluntary agreement with the White House in July 2023 to watermark AI-generated content.[107]

In China, the Interim Measures for the Management of Generative AI Services introduced by the Cyberspace Administration of China regulates any public-facing generative AI. It includes requirements to watermark generated images or videos, regulations on training data and label quality, restrictions on personal data collection, and a guideline that generative AI must "adhere to socialist core values".[108][109]

See also[edit]


  1. ^ Newsom, Gavin; Weber, Shirley N. (September 6, 2023). "Executive Order N-12-23" (PDF). Executive Department, State of California. Retrieved September 7, 2023.
  2. ^ a b Griffith, Erin; Metz, Cade (January 27, 2023). "Anthropic Said to Be Closing In on $300 Million in New A.I. Funding". The New York Times. Retrieved March 14, 2023.
  3. ^ Lanxon, Nate; Bass, Dina; Davalos, Jackie (March 10, 2023). "A Cheat Sheet to AI Buzzwords and Their Meanings". Bloomberg News. Retrieved March 14, 2023.
  4. ^ Pinaya, Walter H. L.; Graham, Mark S.; Kerfoot, Eric; Tudosiu, Petru-Daniel; Dafflon, Jessica; Fernandez, Virginia; Sanchez, Pedro; Wolleb, Julia; da Costa, Pedro F.; Patel, Ashay (2023). "Generative AI for Medical Imaging: extending the MONAI Framework". arXiv:2307.15208 [eess.IV].
  5. ^ Pasick, Adam (March 27, 2023). "Artificial Intelligence Glossary: Neural Networks and Other Terms Explained". The New York Times. ISSN 0362-4331. Retrieved April 22, 2023.
  6. ^ Karpathy, Andrej; Abbeel, Pieter; Brockman, Greg; Chen, Peter; Cheung, Vicki; Duan, Yan; Goodfellow, Ian; Kingma, Durk; Ho, Jonathan; Rein Houthooft; Tim Salimans; John Schulman; Ilya Sutskever; Wojciech Zaremba (June 16, 2016). "Generative models". OpenAI.
  7. ^ Metz, Cade (March 14, 2023). "OpenAI Plans to Up the Ante in Tech's A.I. Race". The New York Times. ISSN 0362-4331. Retrieved March 31, 2023.
  8. ^ Thoppilan, Romal; De Freitas, Daniel; Hall, Jamie; Shazeer, Noam; Kulshreshtha, Apoorv (January 20, 2022). "LaMDA: Language Models for Dialog Applications". arXiv:2201.08239 [cs.CL].
  9. ^ Roose, Kevin (October 21, 2022). "A Coming-Out Party for Generative A.I., Silicon Valley's New Craze". The New York Times. Retrieved March 14, 2023.
  10. ^ "Don't fear an AI-induced jobs apocalypse just yet". The Economist. March 6, 2023. Retrieved March 14, 2023.
  11. ^ Harreis, H.; Koullias, T.; Roberts, Roger. "Generative AI: Unlocking the future of fashion".
  12. ^ "How Generative AI Can Augment Human Creativity". Harvard Business Review. June 16, 2023. ISSN 0017-8012. Retrieved June 20, 2023.
  13. ^ "The race of the AI labs heats up". The Economist. January 30, 2023. Retrieved March 14, 2023.
  14. ^ Yang, June; Gokturk, Burak (March 14, 2023). "Google Cloud brings generative AI to developers, businesses, and governments".
  15. ^ Hendrix, Justin (May 16, 2023). "Transcript: Senate Judiciary Subcommittee Hearing on Oversight of AI". Retrieved May 19, 2023.
  16. ^ Simon, Felix M.; Altay, Sacha; Mercier, Hugo (October 18, 2023). "Misinformation reloaded? Fears about the impact of generative AI on misinformation are overblown". Harvard Kennedy School Misinformation Review. doi:10.37016/mr-2020-127. S2CID 264113883.
  17. ^ Crevier, Daniel (1993). AI: The Tumultuous Search for Artificial Intelligence. New York, New York: BasicBooks. p. 109. ISBN 0-465-02997-3.
  18. ^ Newquist, H. P. (1994). The Brain Makers: Genius, Ego, And Greed In The Quest For Machines That Think. New York: Macmillan/SAMS. pp. 45–53. ISBN 978-0-672-30412-5.
  19. ^ Sharkey, Noel (July 4, 2007), A programmable robot from 60 AD, vol. 2611, New Scientist, archived from the original on January 13, 2018, retrieved October 22, 2019
  20. ^ Brett, Gerard (July 1954), "The Automata in the Byzantine "Throne of Solomon"", Speculum, 29 (3): 477–487, doi:10.2307/2846790, ISSN 0038-7134, JSTOR 2846790, S2CID 163031682.
  21. ^ kelinich (March 8, 2014). "Maillardet's Automaton". The Franklin Institute. Retrieved August 24, 2023.
  22. ^ SITNFlash (August 28, 2017). "The History of Artificial Intelligence". Science in the News. Retrieved December 8, 2023.
  23. ^ Bergen, Nathan; Huang, Angela (2023). "A Brief History of Generative AI" (PDF). Dichotomies: Generative AI: Navigating Towards a Better Future (2): 4.
  24. ^ Gagniuc, Paul A. (2017). Markov Chains: From Theory to Implementation and Experimentation. USA, New Jersey: John Wiley & Sons. pp. 2–8. ISBN 978-1-119-38755-8.
  25. ^ Grinstead, Charles Miller; Snell, James Laurie (1997). Introduction to Probability. American Mathematical Society. pp. 464–466. ISBN 978-0-8218-0749-1.
  26. ^ Bremaud, Pierre (March 9, 2013). Markov Chains: Gibbs Fields, Monte Carlo Simulation, and Queues. Springer Science & Business Media. p. ix. ISBN 978-1-4757-3124-8. Archived from the original on March 23, 2017.
  27. ^ Hayes, Brian (2013). "First Links in the Markov Chain". American Scientist. 101 (2): 92. doi:10.1511/2013.101.92. ISSN 0003-0996.
  28. ^ Fine, Shai; Singer, Yoram; Tishby, Naftali (July 1, 1998). "The Hierarchical Hidden Markov Model: Analysis and Applications". Machine Learning. 32 (1): 41–62. doi:10.1023/A:1007469218079. ISSN 1573-0565. S2CID 3465810.
  29. ^ Jebara, Tony (2012). Machine learning: discriminative and generative. Vol. 755. Springer Science & Business Media.
  30. ^ Cao, Yihan; Li, Siyu; Liu, Yixin; Yan, Zhiling; Dai, Yutong; Yu, Philip S.; Sun, Lichao (March 7, 2023). "A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT". arXiv:2303.04226. {{cite journal}}: Cite journal requires |journal= (help)
  31. ^ "finetune-transformer-lm". GitHub. Retrieved May 19, 2023.
  32. ^ Radford, Alec; Wu, Jeffrey; Child, Rewon; Luan, David; Amodei, Dario; Sutskever, Ilya; others (2019). "Language models are unsupervised multitask learners". OpenAI Blog. 1 (8): 9.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  33. ^ Bubeck, Sébastien; Chandrasekaran, Varun; Eldan, Ronen; Gehrke, Johannes; Horvitz, Eric; Kamar, Ece; Lee, Peter; Lee, Yin Tat; Li, Yuanzhi; Lundberg, Scott; Nori, Harsha; Palangi, Hamid; Ribeiro, Marco Tulio; Zhang, Yi (March 22, 2023). "Sparks of Artificial General Intelligence: Early experiments with GPT-4". arXiv:2303.12712 [cs.CL].
  34. ^ Schlagwein, Daniel; Willcocks, Leslie (September 13, 2023). "ChatGPT et al: The Ethics of Using (Generative) Artificial Intelligence in Research and Science". Journal of Information Technology. 38 (2): 232–238. doi:10.1177/02683962231200411. S2CID 261753752.
  35. ^ "A History of Generative AI: From GAN to GPT-4". March 21, 2023.
  36. ^ "Explainer: What is Generative AI, the technology behind OpenAI's ChatGPT?". Reuters. March 17, 2023. Retrieved March 17, 2023.
  37. ^ Bommasani, R.; Hudson, D. A.; Adeli, E.; Altman, R.; Arora, S.; von Arx, S.; Bernstein, M. S.; Bohg, J.; Bosselut, A; Brunskill, E.; Brynjolfsson, E. (August 16, 2021). "On the opportunities and risks of foundation models". arXiv:2108.07258 [cs.LG].{{cite arXiv}}: CS1 maint: date and year (link)
  38. ^ Chen, Ming; Tworek, Jakub; Jun, Hongyu; Yuan, Qinyuan; Pinto, Hanyu Philippe De Oliveira; Kaplan, Jerry; Edwards, Haley; Burda, Yannick; Joseph, Nicholas; Brockman, Greg; Ray, Alvin (July 6, 2021). "Evaluating Large Language Models Trained on Code". arXiv:2107.03374 [cs.LG].
  39. ^ Epstein, Ziv; Hertzmann, Aaron; Akten, Memo; Farid, Hany; Fjeld, Jessica; Frank, Morgan R.; Groh, Matthew; Herman, Laura; Leach, Neil; Mahari, Robert; Pentland, Alex “Sandy”; Russakovsky, Olga; Schroeder, Hope; Smith, Amy (2023). "Art and the science of generative AI". Science. 380 (6650): 1110–1111. arXiv:2306.04141. Bibcode:2023Sci...380.1110E. doi:10.1126/science.adh4451. PMID 37319193. S2CID 259095707.
  40. ^ Ramesh, Aditya; Pavlov, Mikhail; Goh, Gabriel; Gray, Scott; Voss, Chelsea; Radford, Alec; Chen, Mark; Sutskever, Ilya (2021). "Zero-shot text-to-image generation". International Conference on Machine Learning. PMLR. pp. 8821–8831.
  41. ^ Desai, Saahil (July 17, 2023). "A Voicebot Just Left Me Speechless". The Atlantic. Retrieved November 28, 2023.
  42. ^ Agostinelli, Andrea; Denk, Timo I.; Borsos, Zalán; Engel, Jesse; Verzetti, Mauro; Caillon, Antoine; Huang, Qingqing; Jansen, Aren; Roberts, Adam; Tagliasacchi, Marco; Sharifi, Matt; Zeghidour, Neil; Frank, Christian (January 26, 2023). "MusicLM: Generating Music From Text". arXiv:2301.11325 [cs.SD].
  43. ^ Dalugdug, Mandy (August 3, 2023). "Meta in June said that it used 20,000 hours of licensed music to train MusicGen, which included 10,000 "high-quality" licensed music tracks. At the time, Meta's researchers outlined in a paper the ethical challenges that they encountered around the development of generative AI models like MusicGen".
  44. ^ Metz, Cade (April 4, 2023). "Instant Videos Could Represent the Next Leap in A.I. Technology". The New York Times.
  45. ^ Wong, Queenie (September 29, 2022). "Facebook Parent Meta's AI Tool Can Create Artsy Videos From Text". Retrieved April 4, 2023.
  46. ^ Heaven, Will Douglas (February 15, 2023). "AI is dreaming up drugs that no one has ever seen. Now we've got to see if they work". MIT Technology Review. Massachusetts Institute of Technology. Retrieved March 15, 2023.
  47. ^ Yang, Sherry; Du, Yilun (April 12, 2023). "UniPi: Learning universal policies via text-guided video generation". Google Research, Brain Team. Google AI Blog.
  48. ^ Brohan, Anthony (2023). "RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control". arXiv:2307.15818 [cs.RO].
  49. ^ a b Alting, Leo; Zhang, Hongchao (1989). "Computer aided process planning: the state-of-the-art survey". The International Journal of Production Research. 27 (4): 553–585. doi:10.1080/00207548908942569.
  50. ^ Chien, Steve (1998). "Automated planning and scheduling for goal-based autonomous spacecraft". IEEE Intelligent Systems and Their Applications. 13 (5): 50–55. doi:10.1109/5254.722362.
  51. ^ Burstein, Mark H., ed. (1994). ARPA/Rome Laboratory Knowledge-based Planning and Scheduling Initiative Workshop Proceedings. The Advanced Research Projects Agency, Department of Defense, and Rome Laboratory, US Air Force, Griffiss AFB. p. 219. ISBN 155860345X.
  52. ^ Pell, Barney; Bernard, Douglas E.; Chien, Steve A.; Gat, Erann; Muscettola, Nicola; Nayak, P. Pandurang; Wagner, Michael D.; Williams, Brian C. (1998). Bekey, George A. (ed.). An Autonomous Spacecraft Agent Prototype. Autonomous Robots Volume 5, No. 1. pp. 29–45. Our deliberator is a traditional generative AI planner based on the HSTS planning framework (Muscettola, 1994), and our control component is a traditional spacecraft attitude control system (Hackney et al. 1993). We also add an architectural component explicitly dedicated to world modeling (the mode identifier), and distinguish between control and monitoring.
  53. ^ Sabin, Sam (June 30, 2023). "GitHub has a vision to make code more secure by design". Axios Codebook. Retrieved August 15, 2023.
  54. ^ Vincent, James (March 20, 2023). "Text-to-video AI inches closer as startup Runway announces new model". The Verge. Retrieved August 15, 2023. Text-to-video is the next frontier for generative AI, though current output is rudimentary. Runway says it'll be making its new generative video model, Gen-2, available to users in 'the coming weeks.'
  55. ^ Vanian, Jonathan (March 16, 2023). "Microsoft adds OpenAI technology to Word and Excel". CNBC. Retrieved August 15, 2023. Microsoft is bringing generative artificial intelligence technologies such as the popular ChatGPT chatting app to its Microsoft 365 suite of business software....the new A.I. features, dubbed Copilot, will be available in some of the company's most popular business apps, including Word, PowerPoint and Excel.
  56. ^ Wilson, Mark (August 15, 2023). "The app's Memories feature just got a big upgrade". TechRadar. The Google Photos app is getting a redesigned, AI-powered Memories'll be able to use generative AI to come up with some suggested names like "a desert adventure".
  57. ^ Sullivan, Laurie (May 23, 2023). "Adobe Adds Generative AI To Photoshop". MediaPost. Retrieved August 15, 2023. Generative artificial intelligence (AI) will become one of the most important features for creative designers and marketers. Adobe on Tuesday unveiled a Generative Fill feature in Photoshop to bring Firefly's AI capabilities into design.
  58. ^ Michael Nuñez (July 19, 2023). "LLaMA 2: How to access and use Meta's versatile open-source chatbot right now". VentureBeat. Retrieved August 15, 2023. If you want to run LLaMA 2 on your own machine or modify the code, you can download it directly from Hugging Face, a leading platform for sharing AI models.
  59. ^ Pounder, Les (March 25, 2023). "How To Create Your Own AI Chatbot Server With Raspberry Pi 4". Retrieved August 15, 2023. Using a Pi 4 with 8GB of RAM, you can create a ChatGPT-like server based on LLaMA.
  60. ^ Kemper, Jonathan (November 10, 2022). ""Draw Things" App brings Stable Diffusion to the iPhone". The Decoder. Retrieved August 15, 2023. Draw Things is an app that brings Stable Diffusion to the iPhone. The AI images are generated locally, so you don't need an Internet connection.
  61. ^ Witt, Allan (July 7, 2023). "Best Computer to Run LLaMA AI Model at Home (GPU, CPU, RAM, SSD)". To run LLaMA model at home, you will need a computer build with a powerful GPU that can handle the large amount of data and computation required for inferencing.
  62. ^ Nellis, Stephen; Lee, Jane (September 1, 2022). "U.S. officials order Nvidia to halt sales of top AI chips to China". Reuters. Retrieved August 15, 2023.
  63. ^ Shilov, Anton (May 7, 2023). "Nvidia's Chinese A800 GPU's Performance Revealed". Tom's Hardware. Retrieved August 15, 2023. the A800 operates at 70% of the speed of A100 GPUs while complying with strict U.S. export standards that limit how much processing power Nvidia can sell.
  64. ^ Patel, Dylan (October 24, 2022). "How China's Biren Is Attempting To Evade US Sanctions". Retrieved August 15, 2023.
  65. ^ "5 free software to recognise fake AI-generated images" (in Italian). October 28, 2023.
  66. ^ "Secretary-General's remarks to the Security Council on Artificial Intelligence". July 18, 2023. Retrieved July 27, 2023.
  67. ^ "The Writers Strike Is Taking a Stand on AI". Time. May 4, 2023. Retrieved June 11, 2023.
  68. ^ Tarnoff, Ben (August 4, 2023). "Lessons from Eliza". The Guardian Weekly. pp. 34–39.
  69. ^ Zhou, Viola (April 11, 2023). "AI is already taking video game illustrators' jobs in China". Rest of World. Retrieved August 17, 2023.
  70. ^ Carter, Justin (April 11, 2023). "China's game art industry reportedly decimated by growing AI use". Game Developer. Retrieved August 17, 2023.
  71. ^ Collier, Kevin (July 14, 2023). "Actors vs. AI: Strike brings focus to emerging use of advanced tech". NBC News. SAG-AFTRA has joined the Writer's [sic] Guild of America in demanding a contract that explicitly demands AI regulations to protect writers and the works they create. ... The future of generative artificial intelligence in Hollywood—and how it can be used to replace labor—has become a crucial sticking point for actors going on strike. In a news conference Thursday, Fran Drescher, president of the Screen Actors Guild-American Federation of Television and Radio Artists (more commonly known as SAG-AFTRA), declared that 'artificial intelligence poses an existential threat to creative professions, and all actors and performers deserve contract language that protects them from having their identity and talent exploited without consent and pay.'
  72. ^ Wiggers, Kyle (August 22, 2023). "ElevenLabs' voice-generating tools launch out of beta". TechCrunch. Retrieved September 25, 2023.
  73. ^ Shrivastava, Rashi. "'Keep Your Paws Off My Voice': Voice Actors Worry Generative AI Will Steal Their Livelihoods". Forbes. Retrieved November 28, 2023.
  74. ^ Gupta, Shalene (October 31, 2023). "Underrepresented groups in countries around the world are worried about AI being a threat to jobs". Fast Company. Retrieved December 8, 2023.
  75. ^ "Dangers of unregulated artificial intelligence". CEPR. November 23, 2021. Retrieved December 8, 2023.
  76. ^ Frąckiewicz, Marcin (August 29, 2023). "Artificial Intelligence and the Study of Social Identity". TS2 SPACE. Retrieved December 8, 2023.
  77. ^ Brandon, John (February 16, 2018). "Terrifying high-tech porn: Creepy 'deepfake' videos are on the rise". Fox News. Archived from the original on June 15, 2018. Retrieved February 20, 2018.
  78. ^ Cole, Samantha (January 24, 2018). "We Are Truly Fucked: Everyone Is Making AI-Generated Fake Porn Now". Vice. Archived from the original on September 7, 2019. Retrieved May 4, 2019.
  79. ^ "What Are Deepfakes & Why the Future of Porn is Terrifying". Highsnobiety. February 20, 2018. Archived from the original on July 14, 2021. Retrieved February 20, 2018.
  80. ^ "Experts fear face swapping tech could start an international showdown". The Outline. Archived from the original on January 16, 2020. Retrieved February 28, 2018.
  81. ^ Roose, Kevin (March 4, 2018). "Here Come the Fake Videos, Too". The New York Times. ISSN 0362-4331. Archived from the original on June 18, 2019. Retrieved March 24, 2018.
  82. ^ Schreyer, Marco; Sattarov, Timur; Reimer, Bernd; Borth, Damian (2019). "Adversarial Learning of Deepfakes in Accounting". arXiv:1910.03810 [cs.LG].
  83. ^ "Join the Deepfake Detection Challenge (DFDC)". Archived from the original on January 12, 2020. Retrieved November 8, 2019.
  84. ^ Clarke, Yvette D. (June 28, 2019). "H.R.3230 – 116th Congress (2019-2020): Defending Each and Every Person from False Appearances by Keeping Exploitation Subject to Accountability Act of 2019". Archived from the original on December 17, 2019. Retrieved October 16, 2019.
  85. ^ "People Are Still Terrible: AI Voice-Cloning Tool Misused for Deepfake Celeb Clips". PCMag Middle East. January 31, 2023. Retrieved July 25, 2023.
  86. ^ "The generative A.I. software race has begun". Fortune. Retrieved February 3, 2023.
  87. ^ Milmo, Dan; Hern, Alex (May 20, 2023). "Elections in UK and US at risk from AI-driven disinformation, say experts". The Guardian. ISSN 0261-3077. Retrieved July 25, 2023.
  88. ^ "Seeing is believing? Global scramble to tackle deepfakes". Retrieved February 3, 2023.
  89. ^ Vincent, James (January 31, 2023). "4chan users embrace AI voice clone tool to generate celebrity hatespeech". The Verge. Retrieved February 3, 2023.
  90. ^ Thompson, Stuart A. (March 12, 2023). "Making Deepfakes Gets Cheaper and Easier Thanks to A.I." The New York Times. ISSN 0362-4331. Retrieved July 25, 2023.
  91. ^ "A new AI voice tool is already being abused to make deepfake celebrity audio clips". Engadget. January 31, 2023. Retrieved February 3, 2023.
  92. ^ Gee, Andre (April 20, 2023). "Just Because AI-Generated Rap Songs Go Viral Doesn't Mean They're Good". Rolling Stone. Retrieved December 6, 2023.
  93. ^ Coscarelli, Joe (April 19, 2023). "An A.I. Hit of Fake 'Drake' and 'The Weeknd' Rattles the Music World". The New York Times. Retrieved December 5, 2023.
  94. ^ Lippiello, Emily; Smith, Nathan; Pereira, Ivan (November 3, 2023). "AI songs that mimic popular artists raising alarms in the music industry". ABC News. Retrieved December 6, 2023.
  95. ^ Skelton, Eric. "Fans Are Using Artificial Intelligence to Turn Rap Snippets Into Full Songs". Complex. Retrieved December 6, 2023.
  96. ^ Marr, Bernard. "Virtual Influencer Noonoouri Lands Record Deal: Is She The Future Of Music?". Forbes. Retrieved December 6, 2023.
  97. ^ Thaler, Shannon (September 8, 2023). "Warner Music signs first-ever record deal with AI pop star". New York Post. Retrieved December 6, 2023.
  98. ^ Sjouwerman, Stu (December 26, 2022). "Deepfakes: Get ready for phishing 2.0". Fast Company. Retrieved July 31, 2023.
  99. ^ Sonnemaker, Tyler. "As social media platforms brace for the incoming wave of deepfakes, Google's former 'fraud czar' predicts the biggest danger is that deepfakes will eventually become boring". Business Insider. Retrieved July 31, 2023.
  100. ^ Collinson, Patrick (July 15, 2023). "Fake reviews: can we trust what we read online as use of AI explodes?". The Guardian. ISSN 0261-3077. Retrieved December 6, 2023.
  101. ^ "After WormGPT, FraudGPT Emerges to Help Scammers Steal Your Data". PCMAG. Retrieved July 31, 2023.
  102. ^ Gupta, Maanak; Akiri, Charankumar; Aryal, Kshitiz; Parker, Eli; Praharaj, Lopamudra (2023). "From ChatGPT to ThreatGPT: Impact of Generative AI in Cybersecurity and Privacy | IEEE Journals & Magazine | IEEE Xplore". IEEE Access. 11: 80218–80245. doi:10.1109/ACCESS.2023.3300381. S2CID 259316122. Retrieved November 17, 2023.
  103. ^ Roth, Emma (January 25, 2023). "CNET found errors in more than half of its AI-written stories". The Verge. Retrieved June 17, 2023.
  104. ^ "A magazine touted Michael Schumacher's first interview in years. It was actually AI". NPR. April 28, 2023. Retrieved June 17, 2023.
  105. ^ "ChatGPT and the EU AI Act". Mason Hayes & Curran. Retrieved November 29, 2023.
  106. ^ Chee, Foo Yun; Mukherjee, Supantha (June 14, 2023). "EU lawmakers vote for tougher AI rules as draft moves to final stage". Reuters. Retrieved July 26, 2023.
  107. ^ Bartz, Diane; Hu, Krystal (July 21, 2023). "OpenAI, Google, others pledge to watermark AI content for safety, White House says". Reuters.
  108. ^ Ye, Josh (July 13, 2023). "China says generative AI rules to apply only to products for the public". Reuters. Retrieved July 13, 2023.
  109. ^ "生成式人工智能服务管理暂行办法". July 13, 2023.