Releases: kyolee415/generative-ai-docs
Releases · kyolee415/generative-ai-docs
0.4.2
What's changed
- Enhanced tooling integration: Introduced the
agent toolscommand for
seamless integration with external tools via MCP. - Core system updates: Significant updates were made to model, embedding,
and RAG handling. This includes refactoring model creation, updating embedding
models, adding Chroma backups, and supporting Gemini 2.5 multimodal models. - Flexible task execution: Docs Agent tasks can now execute custom scripts
directly using the newscriptfunction withinruntask. - Improved input handling: The CLI and task files now offer enhanced support
for specifying multiple input files. Use either repeated--fileflags or
a--list_fileflag pointing to a file list.
0.4.1
0.4.0
What's changed
- Multi-modal support: The Docs Agent CLI supports image, audio, and video files as part of a prompt to the Gemini model.
- Formatted output: Select the format of Docs Agent CLI's responses with the
--response_type jsonand--plaintextoptions. - Autocomplete script: The
autocomplete.shscript is added to include Docs Agent CLI commands and options, making it easier and faster to use the Docs Agent CLI on a terminal.
0.3.6
What's changed
- Runtask enhancement: The
runtaskcommand allows specifying a custom input string using the--custom_inputflag. - ExtractWorkflows task: This new task extracts workflows and commandlines from a source document.
- HelpPolishPrompts task: This task helps users polish prompts in a task file.
- IndexPageGenerator task: This task generates an index page describing files in a given directory.
- Chat app bug fix: Fixed an issue where + characters were removed from the page title.
0.3.5
What's changed
agent runtaskcommand: Run tasks defined in.yamlfiles, for example, generating release notes based on commit messages.- Session Management: Start and maintain sessions using
--newand--contflags with thehelpmeandtellmecommands. - CLI Command Refactoring: The CLI is more modular and easier to extend with new commands.
- Web App Mode Updates: A new
widget-promode is added as an iFrame-friendly template. - Bug Fixes: Various bug fixes related to flags, UI, and file handling.
0.3.4
What's changed
- Enhanced file processing: Process an entire directory with the
agent helpmecommand and export results to aresponses.yamlfile. - Improved session control: Manage conversation context using the
--newand--contflags withagent helpme. - Debugging tools: Capture detailed logs for troubleshooting and export them to CSV with the new
agent write-logs-to-csvcommand. - Documentation expansion: Explore updated configuration references, recent release notes, and a new concept doc on chunking.
- Bug fixes: Added fixes for the FIDL splitter, file handling, and blank results, and updates to various README files.
0.3.3
What's changed
- Added a new
agent populatefeature to delete existing chunks
in databases if they are no longer found in source dataset. - Updated the
agent chunkcommand to make sure text chunks
are not bigger than 5 KB for generating embeddings. - Added a feature to display the distribution of chunk sizes
after runningagent chunk. - Added more
agentcommandlines and options. - Updated the "Rewrite" button to the "Feedback" button by default.
- Bug fixes.
0.3.2
What's changed
- Support the new Gemini 1.5 models in preview.
- Add new experimental CLI features to interact with Gemini models directly from a Linux terminal:
agent tellmeandagent helpme. - Better handle uploading of text chunks using the Semantic Retrieval API.
- Add a new chat UI feature to provide a page for viewing logs.
- Update the Google Generative AI SDK version to
0.5.0. - Refactor the pre-preprocessing module (in progress).
- Remove unused code and type mismatch errors.
- Bug fixes
0.3.1
0.3.0
What's changed
- Refactoring of Docs Agent to support a more modularized architecture.
- Added features:
- Benchmarking: Includes benchmark test to measure the quality of AI generated responses.
- Semantic Retrieval API: Supports using Gemini's Semantic Retrieval API
to store text chunks online and the AQA model for question-answering. - Docs Agent CLI: Allows easy management of chunking documents,
populating a vector database, launching the chatbot and more.