文本到图像

Stable Diffusion V3 API Text2Image API 根据文本提示生成图像。

该接口根据请求正文中传递的文本生成并返回图像。

多语言支持语言

如果您在文本提示中使用除英语以外的语言,请在请求体中传递参数"multi_lingual",并将其值设置为"yes"。这将在处理您的请求时触发自动语言检测和翻译。

支持以下语言：

阿拉伯语(ar_AR), 捷克语(cs_CZ), 德语(de_DE), 英语(en_XX), 西班牙语(es_XX), 爱沙尼亚语(et_EE), 芬兰语(fi_FI), 法语(fr_XX), 古吉拉特语(gu_IN), 印地语(hi_IN), 意大利语(it_IT), 日语(ja_XX), 哈萨克语(kk_KZ), 韩语(ko_KR), 立陶宛语(lt_LT), 拉脱维亚语(lv_LV), 缅甸语(my_MM), 尼泊尔语(ne_NP), 荷兰语(nl_XX), 罗马尼亚语(ro_RO), 俄语(ru_RU), 僧伽罗语(si_LK), 土耳其语(tr_TR), 越南语(vi_VN), 中文(zh_CN), 南非语(af_ZA), 阿塞拜疆语(az_AZ), 孟加拉语(bn_IN), 波斯语(fa_IR), 希伯来语(he_IL), 克罗地亚语(hr_HR), 印尼语(id_ID), 格鲁吉亚语(ka_GE), 高棉语(km_KH), 马其顿语(mk_MK), 马拉雅拉姆语(ml_IN), 蒙古语(mn_MN), 马拉地语(mr_IN), 波兰语(pl_PL), 普什图语(ps_AF), 葡萄牙语(pt_XX), 瑞典语(sv_SE), 斯瓦希里语(sw_KE), 泰米尔语(ta_IN), 泰卢固语(te_IN), 泰语(th_TH), 他加禄语(tl_XX), 乌克兰语(uk_UA), 乌尔都语(ur_PK), 科萨语(xh_ZA), 加利西亚语(gl_ES), 斯洛文尼亚语(sl_SI)

curl --location --request POST 'https://stablediffusionapi.com/api/v3/text2img' \ --header 'Content-Type: application/json' \ --data-raw '{ "key": "", "prompt": "ultra realistic close up portrait ((beautiful pale cyberpunk female with heavy black eyeliner))", "negative_prompt": null, "width": "512", "height": "512", "samples": "1", "num_inference_steps": "20", "safety_checker": "no", "enhance_prompt": "yes", "seed": null, "guidance_scale": 7.5, "multi_lingual": "no", "panorama": "no", "self_attention": "no", "upscale": "no", "embeddings_model": null, "webhook": null, "track_id": null }'

{ "status": "success", "generationTime": 1.3200268745422363, "id": 12202888, "output": [ "https://pub-8b49af329fae499aa563997f5d4068a4.r2.dev/generations/e5cd86d3-7305-47fc-82c1-7d1a3b130fa4-0.png" ], "meta": { "H": 512, "W": 512, "enable_attention_slicing": "true", "file_prefix": "e5cd86d3-7305-47fc-82c1-7d1a3b130fa4", "guidance_scale": 7.5, "model": "runwayml/stable-diffusion-v1-5", "n_samples": 1, "negative_prompt": " ((out of frame)), ((extra fingers)), mutated hands, ((poorly drawn hands)), ((poorly drawn face)), (((mutation))), (((deformed))), (((tiling))), ((naked)), ((tile)), ((fleshpile)), ((ugly)), (((abstract))), blurry, ((bad anatomy)), ((bad proportions)), ((extra limbs)), cloned face, glitchy, ((extra breasts)), ((double torso)), ((extra arms)), ((extra hands)), ((mangled fingers)), ((missing breasts)), (missing lips), ((ugly face)), ((fat)), ((extra legs))", "outdir": "out", "prompt": "ultra realistic close up portrait ((beautiful pale cyberpunk female with heavy black eyeliner)) DSLR photography, sharp focus, Unreal Engine 5, Octane Render, Redshift, ((cinematic lighting)), f/1.4, ISO 200, 1/160s, 8K, RAW, unedited, symmetrical balance, in-frame", "revision": "fp16", "safetychecker": "no", "seed": 3499575229, "steps": 20, "vae": "stabilityai/sd-vae-ft-mse" } }

请求参数

Body 参数application/json

key

string

必需

您的 API 密钥用于请求授权。

prompt

string

必需

文本提示，包含您想要在图像中生成的内容的描述。

negative_prompt

null

必需

图像中您不想要的项目。

width

string

必需

最大高度：宽度：1024x1024。

height

string

必需

最大高度：宽度：1024x1024。

samples

string

必需

作为响应返回的图像数量。最大值为 4。

num_inference_steps

string

必需

去噪步骤数。可用值：21、31、41、51。

safety_checker

string

必需

NSFW 图像检查器。如果检测到此类图像，它将被空白图像替换。

enhance_prompt

string

必需

增强提示以获得更好的结果；默认值：是，选项：是/否。

seed

null

必需

种子用于重现结果，相同的种子将再次为您提供相同的图像。为随机数传递null 。

guidance_scale

number

必需

无分类器指导的量表（最小值：1；最大值：20）。

multi_lingual

string

必需

允许多语言提示生成图像。使用“no”作为默认英语。

panorama

string

必需

将此参数设置为“yes”以生成全景图像。

self_attention

string

必需

如果您想要高质量的图像，请将此参数设置为“yes”。在这种情况下，图像生成将花费更多时间。

upscale

string

必需

如果您想要将给定图像分辨率放大两倍 (2x)，请将此参数设置为“yes”。如果请求的分辨率为 512 x 512 像素，则生成的图像将为 1024 x 1024 像素。

embeddings_model

null

必需

这用于传递嵌入模型（embeddings_model_id）。

webhook

null

必需

设置 URL 以在图像生成完成后获取 POST API 调用。

track_id

null

必需

此 ID 在 Webhook API 调用的响应中返回。这将用于识别 Webhook 请求。

示例

返回响应

🟢200成功

application/json

Body

status

string

必需

generationTime

number

必需

integer

必需

output

array[string]

必需

多语言支持语言#

请求参数

返回响应

多语言支持语言