Top Related Projects
🦜🔗 Build context-aware reasoning applications
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
Quick Overview
BabyAGI is an AI-powered task management system that uses OpenAI's language models to create, prioritize, and execute tasks. It demonstrates the potential of AI in automating and optimizing workflow processes, showcasing how AI can assist in breaking down complex goals into manageable tasks.
Pros
- Automates task creation and prioritization
- Demonstrates practical application of AI in workflow management
- Highly customizable and extensible
- Open-source, allowing for community contributions and improvements
Cons
- Requires OpenAI API key, which can be costly for extensive use
- May produce inconsistent or irrelevant tasks depending on the initial prompt
- Limited error handling and robustness
- Potential for recursive or infinite task creation if not properly managed
Code Examples
- Creating a task list:
task_list = TaskList()
initial_task = Task("Develop a marketing strategy for a new product")
task_list.add_task(initial_task)
- Executing the main loop:
baby_agi = BabyAGI(task_list, openai_api_key="your-api-key")
baby_agi.run(max_iterations=5)
- Customizing the execution agent:
class CustomExecutionAgent(ExecutionAgent):
def execute_task(self, task):
# Custom implementation for task execution
result = f"Executed task: {task.description}"
return result
baby_agi = BabyAGI(task_list, execution_agent=CustomExecutionAgent())
Getting Started
-
Clone the repository:
git clone https://github.com/yoheinakajima/babyagi.git cd babyagi
-
Install dependencies:
pip install -r requirements.txt
-
Set up your OpenAI API key:
export OPENAI_API_KEY='your-api-key-here'
-
Run the main script:
python babyagi.py
Competitor Comparisons
🦜🔗 Build context-aware reasoning applications
Pros of LangChain
- More comprehensive framework for building LLM-powered applications
- Extensive documentation and community support
- Wider range of integrations with various LLM providers and tools
Cons of LangChain
- Steeper learning curve due to its extensive features
- May be overkill for simpler projects or prototypes
- Requires more setup and configuration
Code Comparison
LangChain example:
from langchain import OpenAI, LLMChain, PromptTemplate
llm = OpenAI(temperature=0.9)
prompt = PromptTemplate(
input_variables=["product"],
template="What is a good name for a company that makes {product}?",
)
chain = LLMChain(llm=llm, prompt=prompt)
print(chain.run("colorful socks"))
BabyAGI example:
import openai
openai.api_key = "your-api-key-here"
response = openai.Completion.create(
engine="text-davinci-002",
prompt="What is a good name for a company that makes colorful socks?",
max_tokens=50
)
print(response.choices[0].text.strip())
The LangChain example demonstrates its structured approach with chains and templates, while BabyAGI shows a more straightforward implementation using the OpenAI API directly. LangChain offers more flexibility and abstraction, whereas BabyAGI is simpler but less feature-rich.
Pros of TaskMatrix
- Offers a more comprehensive task planning and execution system
- Integrates with external tools and APIs for enhanced functionality
- Provides a visual interface for task management and progress tracking
Cons of TaskMatrix
- More complex setup and configuration required
- Steeper learning curve for new users
- Potentially higher resource requirements due to additional features
Code Comparison
TaskMatrix:
class TaskMatrix:
def __init__(self):
self.tasks = []
self.tools = []
def add_task(self, task):
self.tasks.append(task)
def execute_tasks(self):
for task in self.tasks:
task.execute()
BabyAGI:
class BabyAGI:
def __init__(self):
self.task_list = []
def add_task(self, task):
self.task_list.append(task)
def run(self):
while self.task_list:
task = self.task_list.pop(0)
self.execute_task(task)
TaskMatrix offers a more structured approach with separate task and tool management, while BabyAGI provides a simpler implementation focused on task execution. TaskMatrix's code suggests greater flexibility in handling various task types and external tools, whereas BabyAGI's code emphasizes a straightforward task processing loop.
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Pros of JARVIS
- More comprehensive and feature-rich, offering a wider range of AI capabilities
- Better integration with various tools and platforms, enhancing versatility
- Stronger support and documentation from Microsoft
Cons of JARVIS
- More complex setup and configuration process
- Potentially higher resource requirements due to its extensive features
Code Comparison
JARVIS:
from jarvis import Jarvis
jarvis = Jarvis()
response = jarvis.process_command("Summarize the latest news")
print(response)
BabyAGI:
from babyagi import BabyAGI
baby_agi = BabyAGI()
task = "Summarize the latest news"
result = baby_agi.execute_task(task)
print(result)
Summary
JARVIS offers a more robust and feature-rich AI system with better integration capabilities, while BabyAGI provides a simpler, more focused approach to AI task execution. JARVIS may require more setup and resources, but it offers a wider range of functionalities. BabyAGI, on the other hand, is easier to get started with but may have limitations in terms of advanced features and integrations.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Pros of AutoGPT
- More advanced and feature-rich, offering a wider range of capabilities
- Better user interface and interaction, including a web-based GUI
- Supports multiple AI models and has a more active development community
Cons of AutoGPT
- More complex to set up and use, requiring more configuration
- Higher computational requirements and potentially higher costs
- May be overkill for simpler tasks that BabyAGI can handle efficiently
Code Comparison
AutoGPT:
def start_interaction_loop(self):
# Initialize variables for the interaction loop
loop_count = 0
command_name = None
arguments = None
user_input = ""
BabyAGI:
def run_baby_agi(objective, initial_task, llm, execution_agent):
task_list = deque([])
task_id_counter = 0
task_list.append({"task_id": task_id_counter, "task_name": initial_task})
Both projects aim to create autonomous AI agents, but AutoGPT offers a more comprehensive solution with advanced features and better user interaction. However, this comes at the cost of increased complexity and resource requirements. BabyAGI, on the other hand, provides a simpler and more lightweight approach, making it easier to understand and implement for basic tasks.
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
Pros of AgentGPT
- User-friendly web interface for easy interaction and task management
- Supports multiple language models, including GPT-3.5 and GPT-4
- Offers a more comprehensive agent ecosystem with various agent types
Cons of AgentGPT
- More complex setup and configuration process
- Requires more computational resources due to its advanced features
- Less focused on specific task automation compared to BabyAGI
Code Comparison
BabyAGI:
def add_task(task: Dict):
task.update({"task_id": len(task_list) + 1})
task_list.append(task)
def get_ada_embedding(text):
text = text.replace("\n", " ")
return openai.Embedding.create(input=[text], model="text-embedding-ada-002")["data"][0]["embedding"]
AgentGPT:
const createAgent = async (goal: string, modelName: string) => {
const agent = new Agent(goal, modelName);
await agent.start();
return agent;
};
const executeTask = async (agent: Agent, task: string) => {
const result = await agent.executeTask(task);
return result;
};
Convert designs to code with AI
Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.
Try Visual CopilotREADME
BabyAGI
[!NOTE] The original BabyAGI from March 2023 introduced task planning as a method for developing autonomous agents. This project has been archived and moved to the babyagi_archive repo (September 2024 snapshot).
[!CAUTION] This is a framework built by Yohei who has never held a job as a developer. The purpose of this repo is to share ideas and spark discussion and for experienced devs to play with. Not meant for production use. Use with cautioun.
This newest BabyAGI is an experimental framework for a self-building autonomous agent. Earlier efforts to expand BabyAGI have made it clear that the optimal way to build a general autonomous agent is to build the simplest thing that can build itself.
Check out this introductory X/Twitter thread for a simple overview.
The core is a new function framework (functionz) for storing, managing, and executing functions from a database. It offers a graph-based structure for tracking imports, dependent functions, and authentication secrets, with automatic loading and comprehensive logging capabilities. Additionally, it comes with a dashboard for managing functions, running updates, and viewing logs.
Table of Contents
- Quick Start
- Basic Usage
- Function Metadata
- Function Loading
- Key Dependencies
- Execution Environment
- Dashboard
- Pre-loaded Functions
- Future/Draft Features
- API Reference
- Contributing
- License
Quick Start
To quickly check out the dashboard and see how it works:
-
Install BabyAGI:
pip install babyagi
-
Import BabyAGI and load the dashboard:
import babyagi if __name__ == "__main__": app = babyagi.create_app('/dashboard') app.run(host='0.0.0.0', port=8080)
-
Navigate to the dashboard:
Open your browser and go to
http://localhost:8080/dashboard
to access the BabyAGI dashboard.
Basic Usage
Start by importing babyagi
and registering your functions. Here's how to register two functions, where one depends on the other:
import babyagi
# Register a simple function
@babyagi.register_function()
def world():
return "world"
# Register a function that depends on 'world'
@babyagi.register_function(dependencies=["world"])
def hello_world():
x = world()
return f"Hello {x}!"
# Execute the function
print(babyagi.hello_world()) # Output: Hello world!
if __name__ == "__main__":
app = babyagi.create_app('/dashboard')
app.run(host='0.0.0.0', port=8080)
Function Metadata
Functions can be registered with metadata to enhance their capabilities and manage their relationships. Here's a more comprehensive example of function metadata, showing logical usage of all fields:
import babyagi
@babyagi.register_function(
imports=["math"],
dependencies=["circle_area"],
key_dependencies=["openai_api_key"],
metadata={
"description": "Calculates the volume of a cylinder using the circle_area function."
}
)
def cylinder_volume(radius, height):
import math
area = circle_area(radius)
return area * height
Available Metadata Fields:
imports
: List of external libraries the function depends on.dependencies
: List of other functions this function depends on.key_dependencies
: List of secret keys required by the function.metadata["description"]
: A description of what the function does.
Function Loading
In addition to using register_function
, you can use load_function
to load plugins or draft packs of functions. BabyAGI comes with built-in function packs, or you can load your own packs by pointing to the file path.
You can find available function packs in babyagi/functionz/packs
.
Loading Custom Function Packs:
import babyagi
# Load your custom function pack
babyagi.load_functions("path/to/your/custom_functions.py")
This approach makes function building and management easier by organizing related functions into packs.
Key Dependencies
You can store key_dependencies
directly from your code or manage them via the dashboard.
Storing Key Dependencies from Code:
import babyagi
# Add a secret key
babyagi.add_key_wrapper('openai_api_key', 'your_openai_api_key')
Adding Key Dependencies via Dashboard:
Navigate to the dashboard and use the add_key_wrapper feature to securely add your secret keys.
Execution Environment
BabyAGI automatically loads essential function packs and manages their dependencies, ensuring a seamless execution environment. Additionally, it logs all activities, including the relationships between functions, to provide comprehensive tracking of function executions and dependencies.
Log
BabyAGI implements a comprehensive logging system to track all function executions and their interactions. The logging mechanism ensures that every function call, including its inputs, outputs, execution time, and any errors, is recorded for monitoring and debugging purposes.
Key Logging Features:
-
Execution Tracking: Logs when a function starts and finishes execution, including the function name, arguments, keyword arguments, and execution time.
-
Error Logging: Captures and logs any errors that occur during function execution, providing detailed error messages for troubleshooting.
-
Dependency Management: Automatically resolves and logs dependencies between functions, ensuring that all required functions and libraries are loaded before execution.
-
Trigger Logging: Logs the execution of triggered functions, detailing which functions were triggered by others and their respective execution outcomes.
-
Comprehensive Records: Maintains a history of all function executions, enabling users to review past activities, understand function relationships, and analyze performance metrics.
How Triggers Work:
Triggers are mechanisms that allow certain functions to be automatically executed in response to specific events or actions within the system. For example, when a function is added or updated, a trigger can initiate the generation of a description for that function.
Triggers enhance the autonomy of BabyAGI by enabling automated workflows and reducing the need for manual intervention. However, it's essential to manage triggers carefully to avoid unintended recursive executions or conflicts between dependent functions.
Dashboard
The BabyAGI dashboard offers a user-friendly interface for managing functions, monitoring executions, and handling configurations. Key features include:
-
Function Management: Register, deregister, and update functions directly from the dashboard.
-
Dependency Visualization: View and manage dependencies between functions to understand their relationships.
-
Secret Key Management: Add and manage secret keys securely through the dashboard interface.
-
Logging and Monitoring: Access comprehensive logs of function executions, including inputs, outputs, and execution times.
-
Trigger Management: Set up triggers to automate function executions based on specific events or conditions.
Accessing the Dashboard:
After running your application, navigate to http://localhost:8080/dashboard
to access the BabyAGI dashboard.
Pre-loaded Functions Summary
BabyAGI includes two pre-loaded function packs:
-
Default Functions (
packs/default_functions.py
):- Function Execution: Run, add, update, or retrieve functions and versions.
- Key Management: Add and retrieve secret keys.
- Triggers: Add triggers to execute functions based on others.
- Logs: Retrieve logs with optional filters.
-
AI Functions (
packs/ai_generator.py
):- AI Description & Embeddings: Auto-generate descriptions and embeddings for functions.
- Function Selection: Find or choose similar functions based on prompts.
Running a Self-Building Agent
BabyAGI includes two experimental self-building agents, showcasing how the framework can help a self-building coding agent leverage existing functions to write new ones.
1. process_user_input
in the code_writing_functions
pack
This function first determines whether to use an existing function or generate new ones. If new functions are needed, it breaks them down into smaller reusable components and combines them into a final function.
Try this:
import babyagi
babyagi.add_key_wrapper('openai_api_key', os.environ['OPENAI_API_KEY'])
babyagi.load_functions("drafts/code_writing_functions")
babyagi.process_user_input("Grab today's score from ESPN and email it to test@test.com")
When you run this, you will see the functions being generated in the shell and new functions will be available in the dashboard once completed.
2. self_build
in the self_build
pack
This function takes a user description and generates X distinct tasks that a user might ask an AI assistant. Each task is processed by process_user_input
, creating new functions if no existing ones suffice.
Try this:
import babyagi
babyagi.add_key_wrapper('openai_api_key', os.environ['OPENAI_API_KEY'])
babyagi.load_functions("drafts/code_writing_functions")
babyagi.load_functions("drafts/self_build")
babyagi.self_build("A sales person at an enterprise SaaS company.", 3)
This will generate 3 distinct tasks a salesperson might ask an AI assistant and create functions to handle those.
*The functions will be generated and stored in the dashboard, but note that the generated code is minimal and may need improvement.
Warning: These draft features are experimental concepts and may not function as intended. They require significant improvements and should be used with caution.
Contributing
Contributions are greatly appreciatedly, but candidly I have not been great at managing PRs. Please be patient as things will move slow while I am working on this alone (on nights and weekends). I may start by building a small core crew before collaborating with a larger group.
If you are a dev, investor, friend of open-source and interesting supporting AI work I do, please fill this form (I have a few fun initiatives coming up!)
License
BabyAGI is released under the MIT License. See the LICENSE file for more details.
Top Related Projects
🦜🔗 Build context-aware reasoning applications
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
Convert designs to code with AI
Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.
Try Visual Copilot