Skip to content

Releases: microsoft/genaiscript

v1.93.0

16 Jan 23:36
Compare
Choose a tag to compare

What's New in GenAIScript 1.93.0

🚀 Keyframe Extraction and Scene Detection
Introducing enhanced video processing capabilities! Extract keyframes or detect scenes with adjustable thresholds using the new --keyframes and --scene-threshold options in the CLI.

🗣️ Local Whisper ASR Support
Seamlessly integrate the Whisper ASR Webservice for transcription tasks. Configure it as a model provider and enjoy local transcription support.

📚 Improved Notebook Execution
Notebook frontmatter now supports additional run options like models and providers, enabling more flexible and powerful script execution.

🔧 Enhanced FFmpeg Integration
Optimized frame extraction with wildcard-based output handling, scene-specific frame selection, and improved caching for smoother workflows.

🐛 Bug Fixes

  • Resolved issues with parsing notebook frontmatter.
  • Fixed minor inconsistencies in transcription and frame extraction logic.

v1.92.3

16 Jan 18:09
Compare
Choose a tag to compare

What's New in GenAIScript 1.92.3

🚀 Enhanced Video and Audio Processing

  • Added probe command to extract detailed metadata from video/audio files.
  • Improved extract-audio and extract-frames commands with new options for transcription, frame count, size, and format.

🎥 Large Media Support

  • Optimized frame extraction for large images and videos, ensuring better performance and flexibility.

📜 Transcription Improvements

  • Revamped transcription caching and error handling for seamless speech-to-text conversion.

⚙️ FFmpeg Integration

  • Introduced FFmpeg feature installation in dev environments for enhanced media processing.

🛠️ Developer Experience

  • Simplified CLI workflows and added comprehensive tests for video processing tasks.

Enjoy smoother media handling and powerful new tools!

v1.92.2

15 Jan 23:33
Compare
Choose a tag to compare

What's New in 1.92.2

🚀 Enhanced Webview Handling
Improved handling of external URIs for webview assets, ensuring seamless integration and better external resource management.

🎥 Video Processing Improvements

  • Added support for caching during audio extraction with FFmpeg, optimizing repeated operations.
  • Updated video transcription to leverage caching for alt-text generation.

📂 Cleaner Repositories
Introduced .gitignore for large local test video files, keeping repositories tidy.

🛠️ Bug Fixes & Optimizations

  • Resolved issues with optional parameters in FFmpeg operations.
  • Adjusted input handling to ensure consistent directory management.

v1.92.1

15 Jan 22:25
Compare
Choose a tag to compare

Enhanced FFmpeg Flexibility & Performance

  • Introduced support for custom FFmpeg options, allowing more control over video and audio processing.
  • Improved caching mechanism for FFmpeg operations, reducing redundant computations and enhancing efficiency.

🎥 New Video Processing Features

  • Added new methods to the FFmpeg command builder, including videoBitrate, outputFps, size, autopad, and more.
  • Enabled advanced video transformations, such as aspect ratio adjustments, padding, and grayscale conversion.

🎵 Streamlined Audio Extraction

  • Refined audio extraction with customizable output options for better quality control.

🛠 Developer Experience Improvements

  • Simplified FFmpeg command builder API for easier and more intuitive usage.
  • Improved type definitions for better IDE support and code clarity.

This release unlocks powerful media processing capabilities and optimizes workflows for developers. 🚀

v1.92.0

15 Jan 21:16
Compare
Choose a tag to compare

What's New in GenAIScript 1.92.0

  • 🚀 Enhanced Video Processing: Introduced a robust FFmepgClient for seamless video and audio operations, including frame extraction, audio track extraction, and metadata probing. Simplified API for working with video files.
  • 🎨 Improved Output Logging: Added support for audio logging and markdown table rendering in OutputTrace, making it easier to visualize results.
  • 🌐 Remote Environment Support: Enhanced server to display remote repository details, including URL and branch, for better traceability.
  • Mermaid Diagrams: Added support for rendering Mermaid diagrams in markdown outputs, enabling richer visualizations.
  • 🛠️ VS Code Integration: New webview support for viewing GenAIScript requests directly in VS Code.
  • 🏗️ File Search Flexibility: findFiles now supports an ignore option with array input for more granular file matching.
  • 🗂️ Annotation Improvements: Enhanced annotation handling with better formatting and GitHub Action command integration.

This release brings significant upgrades for video processing, logging, and developer experience.

v1.91.2

14 Jan 22:29
Compare
Choose a tag to compare
  • 🚀 Enhanced Data Handling: Introduced a unified dataTryParse function for seamless parsing of various data formats, improving flexibility and reducing redundancy.
  • 🛠️ New Tools: Added fs_infer_schema and fs_data_query tools to infer JSON schemas and query data using GROQ syntax, empowering data analysis capabilities.
  • 📂 Data Agent: Launched a new "data" agent to analyze and query file data, equipped with schema inference and querying tools.
  • 🔒 Safety Improvements: Integrated content safety checks for harmful and protected material in agents and tools.
  • 🎥 Video Agent Update: Enhanced the video agent with additional safety checks.
  • 📄 Teams Integration: Added functionality to read messages from Microsoft Teams channels, complementing the existing message posting feature.
  • 🧪 Sample Scripts: Introduced new sample scripts demonstrating data schema inference and multi-file data analysis.

v1.91.1

14 Jan 20:49
Compare
Choose a tag to compare

Enhanced Video Capabilities
Introducing powerful video processing tools! Transcribe audio/video files with speech-to-text, extract frames, probe metadata, and even extract audio from video files. Perfect for multimedia projects!

🌐 WebSocket Improvements
WebSocket server now supports dispatching progress updates to all clients, enhancing real-time collaboration and monitoring.

🎤 Speech-to-Audio Support
Added speech synthesis capabilities, enabling text-to-speech generation with customizable voices for a variety of use cases.

🛠️ Developer Utilities
New deleteEmptyValues utility improves object handling, and CLI enhancements include options for dispatching progress events.

🚀 Performance and Stability
Refactored internal structures and fixed minor issues, ensuring a smoother and more reliable experience.

v1.91.0

13 Jan 16:55
Compare
Choose a tag to compare

🚀 GenAIScript 1.91.0 Highlights

🌟 New Features & Improvements

  • Enhanced Embedding Configurations: Added new embedding aliases for streamlined LLM setups.
  • Streaming Output: Introduced robust real-time output streaming for better user feedback.
  • Trace-Driven Output: Improved trace capabilities, enabling detailed output tracking and customization.

🛠️ Quality Enhancements

  • Model Management: Simplified model pulling with expanded support for LMStudio and enhanced alias handling.
  • Video Config & Fetch Updates: Improved video configuration documentation and fixed fetch command formatting for better usability.
  • Error Handling: Enhanced error serialization for clearer debugging.

🔧 Developer Experience

  • Refined APIs: Modularized chat message collapsing and moved it to a dedicated rendering module.
  • Expanded Trace API: Added new output trace capabilities, including detailed logging and structured output support.

🧹 Maintenance

  • Code Cleanup: Removed unused code paths, updated aliases, and optimized internal logic for better performance and readability.

v1.90.1

10 Jan 21:22
Compare
Choose a tag to compare

🎉 Enhanced Video Utilities: Added support for WorkspaceFile in video processing functions, enabling seamless handling of various file types. Extracting audio, frames, and metadata from videos is now more robust and flexible.

Improved Fetch Implementation: Integrated cross-fetch for better compatibility and enhanced proxy support. Added detailed request tracing, including support for multipart form data, to simplify debugging.

🔧 Smarter Configuration Handling: Configuration merging is now more intuitive, with support for arrays and object properties like modelAliases. This update improves flexibility and reduces potential errors.

🐛 Bug Fixes: Resolved issues with error handling, including a fix for misreported error messages. Enhanced the reliability of OpenAI API interactions and transcription workflows.

📂 Sample Configurations: Introduced sample genaiscript.config.yml and large.genai.mjs files to demonstrate configuration and scripting capabilities.

v1.90.0

10 Jan 17:40
Compare
Choose a tag to compare

New Features and Enhancements

  • Added CLI commands for video processing:
    • Extract audio from video files.
    • Extract video frames with options for count, size, and output folder.
  • Integrated HuggingFace's pipeline API for advanced transformer capabilities.
  • Introduced transcription support via OpenAI's Whisper API, enabling audio-to-text conversion with caching and SRT/VTT format generation.
  • Enhanced data slicing in defData to support object field sampling and filtering.

Performance Improvements

  • Optimized hashing with streaming file support and salt integration.
  • Improved concurrency handling for video frame extraction and audio transcoding using FFmpeg.

🛠️ Bug Fixes and Stability

  • Fixed edge cases in file handling for workspace paths.
  • Resolved hashing inconsistencies for various data types like buffers and blobs.
  • Enhanced error handling and logging for video processing and transcription workflows.

🎥 Video and Audio Processing

  • Seamless integration of FFmpeg for video/audio tasks.
  • Added caching for video probes, audio extraction, and frame generation.

💡 Developer Experience

  • Simplified runtime configuration for transcription and video utilities.
  • Improved CLI usability with detailed descriptions and argument validations.