PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

Tag: OpenAI

  • Custom Instructions for ChatGPT: A Deeper Dive into its Implications and Set-Up Process


    TL;DR

    OpenAI has introduced custom instructions for ChatGPT, allowing users to set preferences and requirements to personalize interactions. This is beneficial in diverse areas such as education, programming, and everyday tasks. The feature, still in beta, can be accessed by opting into ‘Custom Instructions’ under ‘Beta Features’ in the settings. OpenAI has also updated its safety measures and privacy policy to handle the new feature.


    As Artificial Intelligence continues to evolve, the demand for personalized and controlled interactions grows. OpenAI’s introduction of custom instructions for ChatGPT reflects a significant stride towards achieving this. By allowing users to set preferences and requirements, OpenAI enhances user interaction and ensures that ChatGPT remains efficient and effective in catering to unique needs.

    The Promise of Custom Instructions

    By analyzing and adhering to user-provided instructions, ChatGPT eliminates the necessity of repeatedly entering the same preferences or requirements, thereby significantly streamlining the user experience. This feature proves particularly beneficial in fields such as education, programming, and even everyday tasks like grocery shopping.

    In education, teachers can set preferences to optimize lesson planning, catering to specific grades and subjects. Meanwhile, developers can instruct ChatGPT to generate efficient code in a non-Python language. For grocery shopping, the model can tailor suggestions for a large family, saving the user time and effort.

    Beyond individual use, this feature can also enhance plugin experiences. By sharing relevant information with the plugins you use, ChatGPT can offer personalized services, such as restaurant suggestions based on your specified location.

    The Set-Up Process

    Plus plan users can access this feature by opting into the beta for custom instructions. On the web, navigate to your account settings, select ‘Beta Features,’ and opt into ‘Custom Instructions.’ For iOS, go to Settings, select ‘New Features,’ and turn on ‘Custom Instructions.’

    While it’s a promising step towards advanced steerability, it’s vital to note that ChatGPT may not always interpret custom instructions perfectly. Misinterpretations and overlooks may occur, especially during the beta period.

    Safety and Privacy

    OpenAI has also adapted its safety measures to account for this new feature. Its Moderation API is designed to ensure instructions that violate the Usage Policies are not saved. The model can refuse or ignore instructions that would lead to responses violating usage policies.

    Custom instructions might be used to improve the model performance across users. However, OpenAI ensures to remove any personal identifiers before these are utilized to improve the model performance. Users can disable this through their data controls, demonstrating OpenAI’s commitment to privacy and data protection.

    The launch of custom instructions for ChatGPT marks a significant advancement in the development of AI, one that pushes us closer to a world of personalized and efficient AI experiences.

  • Leveraging Efficiency: The Promise of Compact Language Models

    Leveraging Efficiency: The Promise of Compact Language Models

    In the world of artificial intelligence chatbots, the common mantra is “the bigger, the better.”

    Large language models such as ChatGPT and Bard, renowned for generating authentic, interactive text, progressively enhance their capabilities as they ingest more data. Daily, online pundits illustrate how recent developments – an app for article summaries, AI-driven podcasts, or a specialized model proficient in professional basketball questions – stand to revolutionize our world.

    However, developing such advanced AI demands a level of computational prowess only a handful of companies, including Google, Meta, OpenAI, and Microsoft, can provide. This prompts concern that these tech giants could potentially monopolize control over this potent technology.

    Further, larger language models present the challenge of transparency. Often termed “black boxes” even by their creators, these systems are complicated to decipher. This lack of clarity combined with the fear of misalignment between AI’s objectives and our own needs, casts a shadow over the “bigger is better” notion, underscoring it as not just obscure but exclusive.

    In response to this situation, a group of burgeoning academics from the natural language processing domain of AI – responsible for linguistic comprehension – initiated a challenge in January to reassess this trend. The challenge urged teams to construct effective language models utilizing data sets that are less than one-ten-thousandth of the size employed by the top-tier large language models. This mini-model endeavor, aptly named the BabyLM Challenge, aims to generate a system nearly as competent as its large-scale counterparts but significantly smaller, more user-friendly, and better synchronized with human interaction.

    Aaron Mueller, a computer scientist at Johns Hopkins University and one of BabyLM’s organizers, emphasized, “We’re encouraging people to prioritize efficiency and build systems that can be utilized by a broader audience.”

    Alex Warstadt, another organizer and computer scientist at ETH Zurich, expressed that the challenge redirects attention towards human language learning, instead of just focusing on model size.

    Large language models are neural networks designed to predict the upcoming word in a given sentence or phrase. Trained on an extensive corpus of words collected from transcripts, websites, novels, and newspapers, they make educated guesses and self-correct based on their proximity to the correct answer.

    The constant repetition of this process enables the model to create networks of word relationships. Generally, the larger the training dataset, the better the model performs, as every phrase provides the model with context, resulting in a more intricate understanding of each word’s implications. To illustrate, OpenAI’s GPT-3, launched in 2020, was trained on 200 billion words, while DeepMind’s Chinchilla, released in 2022, was trained on a staggering trillion words.

    Ethan Wilcox, a linguist at ETH Zurich, proposed a thought-provoking question: Could these AI language models aid our understanding of human language acquisition?

    Traditional theories, like Noam Chomsky’s influential nativism, argue that humans acquire language quickly and effectively due to an inherent comprehension of linguistic rules. However, language models also learn quickly, seemingly without this innate understanding, suggesting that these established theories may need to be reevaluated.

    Wilcox admits, though, that language models and humans learn in fundamentally different ways. Humans are socially engaged beings with tactile experiences, exposed to various spoken words and syntaxes not typically found in written form. This difference means that a computer trained on a myriad of written words can only offer limited insights into our own linguistic abilities.

    However, if a language model were trained only on the vocabulary a young human encounters, it might interact with language in a way that could shed light on our own cognitive abilities.

    With this in mind, Wilcox, Mueller, Warstadt, and a team of colleagues launched the BabyLM Challenge, aiming to inch language models towards a more human-like understanding. They invited teams to train models on roughly the same amount of words a 13-year-old human encounters – around 100 million. These models would be evaluated on their ability to generate and grasp language nuances.

    Eva Portelance, a linguist at McGill University, views the challenge as a pivot from the escalating race for bigger language models towards more accessible, intuitive AI.

    Large industry labs have also acknowledged the potential of this approach. Sam Altman, the CEO of OpenAI, recently stated that simply increasing the size of language models wouldn’t yield the same level of progress seen in recent years. Tech giants like Google and Meta have also been researching more efficient language models, taking cues from human cognitive structures. After all, a model that can generate meaningful language with less training data could potentially scale up too.

    Despite the commercial potential of a successful BabyLM, the challenge’s organizers emphasize that their goals are primarily academic. And instead of a monetary prize, the reward lies in the intellectual accomplishment. As Wilcox puts it, the prize is “Just pride.”

  • AI Industry Pioneers Advocate for Consideration of Potential Challenges Amid Rapid Technological Progress

    AI Industry Pioneers Advocate for Consideration of Potential Challenges Amid Rapid Technological Progress

    On Tuesday, a collective of industry frontrunners plans to express their concern about the potential implications of artificial intelligence technology, which they have a hand in developing. They suggest that it could potentially pose significant challenges to society, paralleling the severity of pandemics and nuclear conflicts.

    The anticipated statement from the Center for AI Safety, a nonprofit organization, will call for a global focus on minimizing potential challenges from AI. This aligns it with other significant societal issues, such as pandemics and nuclear war. Over 350 AI executives, researchers, and engineers have signed this open letter.

    Signatories include chief executives from leading AI companies such as OpenAI’s Sam Altman, Google DeepMind’s Demis Hassabis, and Anthropic’s Dario Amodei.

    In addition, Geoffrey Hinton and Yoshua Bengio, two Turing Award-winning researchers for their pioneering work on neural networks, have signed the statement, along with other esteemed researchers. Yann LeCun, the third Turing Award winner, who leads Meta’s AI research efforts, had not signed as of Tuesday.

    This statement arrives amidst escalating debates regarding the potential consequences of artificial intelligence. Innovations in large language models, as employed by ChatGPT and other chatbots, have sparked concerns about the misuse of AI in spreading misinformation or possibly disrupting numerous white-collar jobs.

    While the specifics are not always elaborated, some in the field argue that unmitigated AI developments could lead to societal-scale disruptions in the not-so-distant future.

    Interestingly, these concerns are echoed by many industry leaders, placing them in the unique position of suggesting tighter regulations on the very technology they are working to develop and advance.

    In an attempt to address these concerns, Altman, Hassabis, and Amodei recently engaged in a conversation with President Biden and Vice President Kamala Harris on the topic of AI regulation. Following this meeting, Altman emphasized the importance of government intervention to mitigate the potential challenges posed by advanced AI systems.

    In an interview, Dan Hendrycks, executive director of the Center for AI Safety, suggested that the open letter represented a public acknowledgment from some industry figures who previously only privately expressed their concerns about potential risks associated with AI technology development.

    While some critics argue that current AI technology is too nascent to pose a significant threat, others contend that the rapid progress of AI has already exceeded human performance in some areas. These proponents believe that the emergence of “artificial general intelligence,” or AGI, an AI capable of performing a wide variety of tasks at or beyond human-level performance, may not be too far off.

    In a recent blog post, Altman, along with two other OpenAI executives, proposed several strategies to manage powerful AI systems responsibly. They proposed increased cooperation among AI developers, further technical research into large language models, and the establishment of an international AI safety organization akin to the International Atomic Energy Agency.

    Furthermore, Altman has endorsed regulations requiring the developers of advanced AI models to obtain a government-issued license.

    Earlier this year, over 1,000 technologists and researchers signed another open letter advocating for a six-month halt on the development of the largest AI models. They cited fears about an unregulated rush to develop increasingly powerful digital minds.

    The new statement from the Center for AI Safety is brief, aiming to unite AI experts who share general concerns about powerful AI systems, regardless of their views on specific risks or prevention strategies.

    Geoffrey Hinton, a high-profile AI expert, recently left his position at Google to openly discuss potential AI implications. The statement has since been circulated and signed by some employees at major AI labs.

    The recent increased use of AI chatbots for entertainment, companionship, and productivity, combined with the rapid advancements in the underlying technology, has amplified the urgency of addressing these concerns.

    Altman emphasized this urgency in his Senate subcommittee testimony, saying, “We want to work with the government to prevent [potential challenges].”

  • Meet Auto-GPT: The AI Game-Changer

    Meet Auto-GPT: The AI Game-Changer

    A game-changing AI agent called Auto-GPT has been making waves in the field of artificial intelligence. Developed by Toran Bruce Richards and released on March 30, 2023, Auto-GPT is designed to achieve goals set in natural language by breaking them into sub-tasks and using the internet and other tools autonomously. Utilizing OpenAI’s GPT-4 or GPT-3.5 APIs, it is among the first applications to leverage GPT-4’s capabilities for performing autonomous tasks.

    Revolutionizing AI Interaction

    Unlike interactive systems such as ChatGPT, which require manual commands for every task, Auto-GPT takes a more proactive approach. It assigns itself new objectives to work on with the aim of reaching a greater goal without the need for constant human input. Auto-GPT can execute responses to prompts to accomplish a goal, and in doing so, will create and revise its own prompts to recursive instances in response to new information.

    Auto-GPT manages short-term and long-term memory by writing to and reading from databases and files, handling context window length requirements with summarization. Additionally, it can perform internet-based actions such as web searching, web form, and API interactions unattended, and includes text-to-speech for voice output.

    Notable Capabilities

    Observers have highlighted Auto-GPT’s ability to iteratively write, debug, test, and edit code, with some even suggesting that this ability may extend to Auto-GPT’s own source code, enabling a degree of self-improvement. However, as its underlying GPT models are proprietary, Auto-GPT cannot modify them.

    Background and Reception

    The release of Auto-GPT comes on the heels of OpenAI’s GPT-4 launch on March 14, 2023. GPT-4, a large language model, has been widely praised for its substantially improved performance across various tasks. While GPT-4 itself cannot perform actions autonomously, red-team researchers found during pre-release safety testing that it could be enabled to perform real-world actions, such as convincing a TaskRabbit worker to solve a CAPTCHA challenge.

    A team of Microsoft researchers argued that GPT-4 “could reasonably be viewed as an early (yet still incomplete) version of an artificial general intelligence (AGI) system.” However, they also emphasized the system’s significant limitations.

    Auto-GPT, developed by Toran Bruce Richards, founder of video game company Significant Gravitas Ltd, became the top trending repository on GitHub shortly after its release and has repeatedly trended on Twitter since.

    Auto-GPT represents a significant breakthrough in artificial intelligence, demonstrating the potential for AI agents to perform autonomous tasks with minimal human input. While there are still limitations to overcome, Auto-GPT’s innovative approach to goal-setting and task management has set the stage for further advancements in the development of AGI systems.