blip-2-vision-language
π―Skillfrom orchestra-research/ai-research-skills
Performs advanced visual question answering and image captioning using the BLIP-2 multimodal AI model for understanding image-text relationships.
Part of
orchestra-research/ai-research-skills(84 items)
Installation
npx @orchestra-research/ai-research-skillsnpx @orchestra-research/ai-research-skills list # View installed skillsnpx @orchestra-research/ai-research-skills update # Update installed skills/plugin marketplace add orchestra-research/AI-research-SKILLs/plugin install fine-tuning@ai-research-skills # Axolotl, LLaMA-Factory, PEFT, Unsloth+ 4 more commands
More from this repository10
Streamlines AI research workflows by providing curated Claude skills for data analysis, literature review, experiment design, and research paper generation.
Assists AI researchers in drafting, structuring, and generating machine learning research papers with academic writing best practices and technical precision.
Streamlines distributed data processing and machine learning workflows using Ray's scalable data loading and transformation capabilities.
Streamlines distributed machine learning training using Ray, optimizing hyperparameter tuning and parallel model execution across compute clusters.
Tokenize and encode text using advanced subword segmentation for multilingual NLP tasks and machine translation models
Trains compact language models with minimal compute, enabling efficient text generation and fine-tuning on small datasets using PyTorch and transformer architectures.
Streamlines reinforcement learning model training in PyTorch with automated hyperparameter tuning, environment setup, and advanced policy optimization techniques.
Validates and sanitizes AI prompts to prevent injection attacks, filter sensitive content, and ensure safe, controlled interactions with language models.
Implements and evaluates RWKV language model architectures, providing tools for training, fine-tuning, and performance analysis of linear attention transformer alternatives.
Perform targeted neural network interventions and activation manipulations to probe model behavior and understand internal representations in PyTorch.