Instructions to use LLM360/CrystalChat-7B-Web2Code with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use LLM360/CrystalChat-7B-Web2Code with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="LLM360/CrystalChat-7B-Web2Code", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("LLM360/CrystalChat-7B-Web2Code", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use LLM360/CrystalChat-7B-Web2Code with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "LLM360/CrystalChat-7B-Web2Code" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "LLM360/CrystalChat-7B-Web2Code", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/LLM360/CrystalChat-7B-Web2Code
- SGLang
How to use LLM360/CrystalChat-7B-Web2Code with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "LLM360/CrystalChat-7B-Web2Code" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "LLM360/CrystalChat-7B-Web2Code", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "LLM360/CrystalChat-7B-Web2Code" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "LLM360/CrystalChat-7B-Web2Code", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use LLM360/CrystalChat-7B-Web2Code with Docker Model Runner:
docker model run hf.co/LLM360/CrystalChat-7B-Web2Code
Update README.md
Browse files
README.md
CHANGED
|
@@ -19,6 +19,24 @@ datasets:
|
|
| 19 |
CrystalChat-7B based multi-modal large language model (MLLM) mimics the training recipe used for Vicuna-7B based [LLaVa-v1.5](https://huggingface.co/docs/transformers/main/model_doc/llava). CrystalChat-7B based MLLMs models are entirely transparent, having open-sourced all materials, including code, data, model checkpoint, intermediate results, and more at [Web2Code: A Large-scale Webpage-to-Code Dataset
|
| 20 |
and Evaluation Framework for Multimodal LLMs](https://arxiv.org/pdf/2406.20098). CrystalChat-7B-Web2Code MLLM is specialized in webpage images-to-html code generation.
|
| 21 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
## Web2Code Dataset
|
| 23 |
Our Web2Code instruction tuning dataset construction and instruction generation process
|
| 24 |
involves four key components:
|
|
@@ -142,29 +160,6 @@ The dataset chosen was created by LLaVA with academic-task-oriented VQA data mix
|
|
| 142 |
|
| 143 |
**Table 6:** Distribution of DWU and DWU<sub>R</sub> datasets. Both datasets include high-quality question-answer pairs for webpage understanding.*
|
| 144 |
|
| 145 |
-
|
| 146 |
-
|
| 147 |
-
## Examples
|
| 148 |
-
|
| 149 |
-
|
| 150 |
-
|
| 151 |
-
**Example 1: Hand drawn images**
|
| 152 |
-
|
| 153 |
-
|  |  |
|
| 154 |
-
|:----------------------:|:----------------------:|
|
| 155 |
-
| Hand Drawn Webpage | CrystalChat-Web2Code Rendering |
|
| 156 |
-
|
| 157 |
-
|
| 158 |
-
**Example 2: Recreate a webpage from an image**
|
| 159 |
-
Image 1: Original Webpage
|
| 160 |
-
<center><img src="images2/ori.png" alt="k2 eval table" /></center>
|
| 161 |
-
|
| 162 |
-
Image 2: CrystalChat-Web2Code Rendering
|
| 163 |
-
<center><img src="images2/crystalchat.png" alt="k2 eval table" /></center>
|
| 164 |
-
|
| 165 |
-
|
| 166 |
-
**Image 3:** Hand-drawn webpage input to CrystalChat-7B-Web2Code generated output.
|
| 167 |
-
|
| 168 |
## Loading Crystal
|
| 169 |
```python
|
| 170 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
|
|
|
| 19 |
CrystalChat-7B based multi-modal large language model (MLLM) mimics the training recipe used for Vicuna-7B based [LLaVa-v1.5](https://huggingface.co/docs/transformers/main/model_doc/llava). CrystalChat-7B based MLLMs models are entirely transparent, having open-sourced all materials, including code, data, model checkpoint, intermediate results, and more at [Web2Code: A Large-scale Webpage-to-Code Dataset
|
| 20 |
and Evaluation Framework for Multimodal LLMs](https://arxiv.org/pdf/2406.20098). CrystalChat-7B-Web2Code MLLM is specialized in webpage images-to-html code generation.
|
| 21 |
|
| 22 |
+
|
| 23 |
+
## CrystalChat-Web2Code Features
|
| 24 |
+
|
| 25 |
+
**Covert hand-drawn images to a website**
|
| 26 |
+
|
| 27 |
+
|  |  |
|
| 28 |
+
|:----------------------:|:----------------------:|
|
| 29 |
+
| Hand Drawn Webpage | CrystalChat-Web2Code Rendering |
|
| 30 |
+
|
| 31 |
+
|
| 32 |
+
**Recreate a new webpage from an existing webpage**
|
| 33 |
+
Image 1: Original Webpage
|
| 34 |
+
<center><img src="images2/ori.png" alt="k2 eval table" /></center>
|
| 35 |
+
|
| 36 |
+
Image 2: CrystalChat-Web2Code Rendering
|
| 37 |
+
<center><img src="images2/crystalchat.png" alt="k2 eval table" /></center>
|
| 38 |
+
|
| 39 |
+
|
| 40 |
## Web2Code Dataset
|
| 41 |
Our Web2Code instruction tuning dataset construction and instruction generation process
|
| 42 |
involves four key components:
|
|
|
|
| 160 |
|
| 161 |
**Table 6:** Distribution of DWU and DWU<sub>R</sub> datasets. Both datasets include high-quality question-answer pairs for webpage understanding.*
|
| 162 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 163 |
## Loading Crystal
|
| 164 |
```python
|
| 165 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|