Releases: microsoft/genaiscript
v1.93.0
What's New in GenAIScript 1.93.0
🚀 Keyframe Extraction and Scene Detection
Introducing enhanced video processing capabilities! Extract keyframes or detect scenes with adjustable thresholds using the new --keyframes
and --scene-threshold
options in the CLI.
🗣️ Local Whisper ASR Support
Seamlessly integrate the Whisper ASR Webservice for transcription tasks. Configure it as a model provider and enjoy local transcription support.
📚 Improved Notebook Execution
Notebook frontmatter now supports additional run options like models and providers, enabling more flexible and powerful script execution.
🔧 Enhanced FFmpeg Integration
Optimized frame extraction with wildcard-based output handling, scene-specific frame selection, and improved caching for smoother workflows.
🐛 Bug Fixes
- Resolved issues with parsing notebook frontmatter.
- Fixed minor inconsistencies in transcription and frame extraction logic.
v1.92.3
What's New in GenAIScript 1.92.3
🚀 Enhanced Video and Audio Processing
- Added
probe
command to extract detailed metadata from video/audio files. - Improved
extract-audio
andextract-frames
commands with new options for transcription, frame count, size, and format.
🎥 Large Media Support
- Optimized frame extraction for large images and videos, ensuring better performance and flexibility.
📜 Transcription Improvements
- Revamped transcription caching and error handling for seamless speech-to-text conversion.
⚙️ FFmpeg Integration
- Introduced FFmpeg feature installation in dev environments for enhanced media processing.
🛠️ Developer Experience
- Simplified CLI workflows and added comprehensive tests for video processing tasks.
Enjoy smoother media handling and powerful new tools!
v1.92.2
What's New in 1.92.2
🚀 Enhanced Webview Handling
Improved handling of external URIs for webview assets, ensuring seamless integration and better external resource management.
🎥 Video Processing Improvements
- Added support for caching during audio extraction with FFmpeg, optimizing repeated operations.
- Updated video transcription to leverage caching for alt-text generation.
📂 Cleaner Repositories
Introduced .gitignore
for large local test video files, keeping repositories tidy.
🛠️ Bug Fixes & Optimizations
- Resolved issues with optional parameters in FFmpeg operations.
- Adjusted input handling to ensure consistent directory management.
v1.92.1
✨ Enhanced FFmpeg Flexibility & Performance
- Introduced support for custom FFmpeg options, allowing more control over video and audio processing.
- Improved caching mechanism for FFmpeg operations, reducing redundant computations and enhancing efficiency.
🎥 New Video Processing Features
- Added new methods to the FFmpeg command builder, including
videoBitrate
,outputFps
,size
,autopad
, and more. - Enabled advanced video transformations, such as aspect ratio adjustments, padding, and grayscale conversion.
🎵 Streamlined Audio Extraction
- Refined audio extraction with customizable output options for better quality control.
🛠 Developer Experience Improvements
- Simplified FFmpeg command builder API for easier and more intuitive usage.
- Improved type definitions for better IDE support and code clarity.
This release unlocks powerful media processing capabilities and optimizes workflows for developers. 🚀
v1.92.0
What's New in GenAIScript 1.92.0
- 🚀 Enhanced Video Processing: Introduced a robust
FFmepgClient
for seamless video and audio operations, including frame extraction, audio track extraction, and metadata probing. Simplified API for working with video files. - 🎨 Improved Output Logging: Added support for audio logging and markdown table rendering in
OutputTrace
, making it easier to visualize results. - 🌐 Remote Environment Support: Enhanced server to display remote repository details, including URL and branch, for better traceability.
- ✨ Mermaid Diagrams: Added support for rendering Mermaid diagrams in markdown outputs, enabling richer visualizations.
- 🛠️ VS Code Integration: New webview support for viewing GenAIScript requests directly in VS Code.
- 🏗️ File Search Flexibility:
findFiles
now supports anignore
option with array input for more granular file matching. - 🗂️ Annotation Improvements: Enhanced annotation handling with better formatting and GitHub Action command integration.
This release brings significant upgrades for video processing, logging, and developer experience.
v1.91.2
- 🚀 Enhanced Data Handling: Introduced a unified
dataTryParse
function for seamless parsing of various data formats, improving flexibility and reducing redundancy. - 🛠️ New Tools: Added
fs_infer_schema
andfs_data_query
tools to infer JSON schemas and query data using GROQ syntax, empowering data analysis capabilities. - 📂 Data Agent: Launched a new "data" agent to analyze and query file data, equipped with schema inference and querying tools.
- 🔒 Safety Improvements: Integrated content safety checks for harmful and protected material in agents and tools.
- 🎥 Video Agent Update: Enhanced the video agent with additional safety checks.
- 📄 Teams Integration: Added functionality to read messages from Microsoft Teams channels, complementing the existing message posting feature.
- 🧪 Sample Scripts: Introduced new sample scripts demonstrating data schema inference and multi-file data analysis.
v1.91.1
✨ Enhanced Video Capabilities
Introducing powerful video processing tools! Transcribe audio/video files with speech-to-text, extract frames, probe metadata, and even extract audio from video files. Perfect for multimedia projects!
🌐 WebSocket Improvements
WebSocket server now supports dispatching progress updates to all clients, enhancing real-time collaboration and monitoring.
🎤 Speech-to-Audio Support
Added speech synthesis capabilities, enabling text-to-speech generation with customizable voices for a variety of use cases.
🛠️ Developer Utilities
New deleteEmptyValues
utility improves object handling, and CLI enhancements include options for dispatching progress events.
🚀 Performance and Stability
Refactored internal structures and fixed minor issues, ensuring a smoother and more reliable experience.
v1.91.0
🚀 GenAIScript 1.91.0 Highlights
🌟 New Features & Improvements
- Enhanced Embedding Configurations: Added new embedding aliases for streamlined LLM setups.
- Streaming Output: Introduced robust real-time output streaming for better user feedback.
- Trace-Driven Output: Improved trace capabilities, enabling detailed output tracking and customization.
🛠️ Quality Enhancements
- Model Management: Simplified model pulling with expanded support for LMStudio and enhanced alias handling.
- Video Config & Fetch Updates: Improved video configuration documentation and fixed fetch command formatting for better usability.
- Error Handling: Enhanced error serialization for clearer debugging.
🔧 Developer Experience
- Refined APIs: Modularized chat message collapsing and moved it to a dedicated rendering module.
- Expanded Trace API: Added new output trace capabilities, including detailed logging and structured output support.
🧹 Maintenance
- Code Cleanup: Removed unused code paths, updated aliases, and optimized internal logic for better performance and readability.
v1.90.1
🎉 Enhanced Video Utilities: Added support for WorkspaceFile
in video processing functions, enabling seamless handling of various file types. Extracting audio, frames, and metadata from videos is now more robust and flexible.
✨ Improved Fetch Implementation: Integrated cross-fetch
for better compatibility and enhanced proxy support. Added detailed request tracing, including support for multipart form data, to simplify debugging.
🔧 Smarter Configuration Handling: Configuration merging is now more intuitive, with support for arrays and object properties like modelAliases
. This update improves flexibility and reduces potential errors.
🐛 Bug Fixes: Resolved issues with error handling, including a fix for misreported error messages. Enhanced the reliability of OpenAI API interactions and transcription workflows.
📂 Sample Configurations: Introduced sample genaiscript.config.yml
and large.genai.mjs
files to demonstrate configuration and scripting capabilities.
v1.90.0
✨ New Features and Enhancements
- Added CLI commands for video processing:
- Extract audio from video files.
- Extract video frames with options for count, size, and output folder.
- Integrated HuggingFace's
pipeline
API for advanced transformer capabilities. - Introduced transcription support via OpenAI's Whisper API, enabling audio-to-text conversion with caching and SRT/VTT format generation.
- Enhanced data slicing in
defData
to support object field sampling and filtering.
⚡ Performance Improvements
- Optimized hashing with streaming file support and salt integration.
- Improved concurrency handling for video frame extraction and audio transcoding using FFmpeg.
🛠️ Bug Fixes and Stability
- Fixed edge cases in file handling for workspace paths.
- Resolved hashing inconsistencies for various data types like buffers and blobs.
- Enhanced error handling and logging for video processing and transcription workflows.
🎥 Video and Audio Processing
- Seamless integration of FFmpeg for video/audio tasks.
- Added caching for video probes, audio extraction, and frame generation.
💡 Developer Experience
- Simplified runtime configuration for transcription and video utilities.
- Improved CLI usability with detailed descriptions and argument validations.