You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First of all, congratulations for such an awesome and complete image editing survey and special thanks for including our paper Forgedit: text guided image editing via learning and forgetting in the survey. I am the first author of this paper and I think there might be some misunderstandings of our method in Table 1 of this survey and mistakes for editing results presented Figure 13.
First, our Forgedit was designed to tackle general text-guided image editings so in fact Forgedit is capable of conducting most tasks in Table 1, which I will show in the following examples.
Second, the editing results with Forgedit in Figure 13 is incorrect. Recently I have refined the Forgedit code to make it easier to reproduce our results. Next, I will show the results of all editing examples from editeval-v1 in Figure 13 from your paper and list the hyperparameters to reproduce our results. For fair comparison, all results are obtained with Stable Diffusion 1.4. The success rate and editing quality could be improved if the base model is switched to realistic vision series. Only target prompt and input image of editeval-v1 are used since Forgedit could use BLIP to generate source prompt.
Editing type: action change
Input Image, target prompt='A polar bear raising its hand'
editing type: object addition
Input Image, target prompt='A glass of milk next to a stack of cookies on a wooden board with a gray background'
forgedit command accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='donotforget' --interpolation=vs --gammastart=13 --gammaend=15 --numtest=7
editing type: object replacement
input image, target prompt='A floor lamp standing next to a potted plant in a cozy room'
forgedit command accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='donotforget' --interpolation=vs --gammastart=11 --gammaend=15 --numtest=7
editing type: background change
input image, target prompt='A silver car parked at a dense jungle'
forgedti command accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='encoderattn+encoder1' --interpolation=vs --gammastart=12 --gammaend=15 --numtest=4
editing type: style change
input image, target prompt='A Van Gogh style painting of a light house sitting on a cliff next to the ocean'
forgedit command accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='donotforget' --interpolation=vs --gammastart=13 --gammaend=15 --numtest=7
editing type: texture change
input image, target prompt='A statue of a horse running in a field'
forgedit command accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='decoderattn+decoder2' --interpolation=vs --gammastart=13 --gammaend=17 --numtest=4
For emotion expression editing, Forgedit could tackle it too.
Input image, target prompt='a smiling man and a smiling woman'
Here I switch to use realistic vision for human editing, yet I think Stable Diffusion 1.4 should be working too.
forgedit command accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='donotforget' --interpolation=vs --targeth=768 --targetw=768 --gammastart=8 --gammaend=11
for object move and object size change, there are multiple cases in TEdBench, another text-guided image editing benchmark from Google. Our forgedit could tackle these cases too. The results could be found in Forgedit TEdBench.
Finally, if you have any difficulties reproducing Forgedit's results on editeval, feel free to leave a comment or contact me via email. It would be great if the editing results of Forgedit could be corrected in the next version of this survey. Thanks again.
The text was updated successfully, but these errors were encountered:
First of all, congratulations for such an awesome and complete image editing survey and special thanks for including our paper Forgedit: text guided image editing via learning and forgetting in the survey. I am the first author of this paper and I think there might be some misunderstandings of our method in Table 1 of this survey and mistakes for editing results presented Figure 13.
First, our Forgedit was designed to tackle general text-guided image editings so in fact Forgedit is capable of conducting most tasks in Table 1, which I will show in the following examples.
![image](https://private-user-images.githubusercontent.com/144800993/320217166-2fe219e8-fc11-469c-8c8a-4006c710861c.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjE3MTY2LTJmZTIxOWU4LWZjMTEtNDY5Yy04YzhhLTQwMDZjNzEwODYxYy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT00ZGU0NzNhMDI1OTBiYTAwNDhjNzNmODkwYmVhMjdkZjJlM2M3MTMyZGZkMGJjY2I5NmYzMWIzMjZkOWVhYjhkJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.z2KXlMje3HGqzQS-B19Lg8kHF-4Y9lL_Sv5nih4K_7M)
Second, the editing results with Forgedit in Figure 13 is incorrect. Recently I have refined the Forgedit code to make it easier to reproduce our results. Next, I will show the results of all editing examples from editeval-v1 in Figure 13 from your paper and list the hyperparameters to reproduce our results. For fair comparison, all results are obtained with Stable Diffusion 1.4. The success rate and editing quality could be improved if the base model is switched to realistic vision series. Only target prompt and input image of editeval-v1 are used since Forgedit could use BLIP to generate source prompt.
![image](https://private-user-images.githubusercontent.com/144800993/320217372-2e5c75fb-9b59-485d-b6be-0d03795420c2.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjE3MzcyLTJlNWM3NWZiLTliNTktNDg1ZC1iNmJlLTBkMDM3OTU0MjBjMi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT02NzVkMTkzODgwMmNiNzc0NmU1ZjM1MTk5MGVkM2U3YTFjZjgyNjA4NTk3M2RlMGY3NzVhN2ViMzE1NjYxZTczJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.huRC6GuLLQDNU7xhPN6MaRQrDWz_ebcHPwnXFLS9fgc)
![2](https://private-user-images.githubusercontent.com/144800993/320219721-906b940e-fba7-4757-b113-bbd7085e2e70.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjE5NzIxLTkwNmI5NDBlLWZiYTctNDc1Ny1iMTEzLWJiZDcwODVlMmU3MC5qcGc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0zZWZlZGJhY2IyOWJiNTJkOGQ4ZjViYTEzNzZkOTA4NjM3ZWRkM2NmNzkwMjA1NzliYjgxODgzZmVhMjE3MmQxJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.6SJTM4YKcKBEm4U5YPqsbjn3396MjBor9NH5ENoy5nE)
Editing type: action change
Input Image, target prompt='A polar bear raising its hand'
forgedit command:
![3_encoderattn_A polar bear raising its hand_guidance_scale=7 5__textalpha=0 0_alpha=1 2000000000000002_](https://private-user-images.githubusercontent.com/144800993/320219668-e57067e4-f147-46bb-a7f1-f1e2e9589eb2.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjE5NjY4LWU1NzA2N2U0LWYxNDctNDZiYi1hN2YxLWYxZTJlOTU4OWViMi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0xZjJkMmU1ZTNhNmIyOTcwNDNiYjNmMzc0ZmI3NmFiNGE0OGM2MjYwN2ExNWRkYTZlOTVhMWFhMzMyNjRlYjQzJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.vnt_tps1IUbiXVV5xmhkH31oLy8h4J9HrQeDi8GNvEo)
![7_encoderattn_A polar bear raising its hand_guidance_scale=7 5__textalpha=0 0_alpha=1 2000000000000002_2 jpg](https://private-user-images.githubusercontent.com/144800993/320220062-261e5007-7de0-49aa-965a-c74525606d07.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIwMDYyLTI2MWU1MDA3LTdkZTAtNDlhYS05NjVhLWM3NDUyNTYwNmQwNy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT03ZDIwZDliNTQwYWQyMjQ2YTFlYWIxMWYwOGUyYjdjZTJlNTllZTcyZGExMjE2NWM4MmJiMDdkZDUyMzNmNTUyJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.vS4UjSXW_bR84gOu27EgYiDSkzogcbVyWhHj4qkJxDQ)
![8_encoderattn_A polar bear raising its hand_guidance_scale=7 5__textalpha=0 0_alpha=1 2000000000000002_2 jpg](https://private-user-images.githubusercontent.com/144800993/320220224-60f67183-03a5-4cd5-a65e-b0a923c9e230.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIwMjI0LTYwZjY3MTgzLTAzYTUtNGNkNS1hNjVlLWIwYTkyM2M5ZTIzMC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT02MDY4NDA3NTAwNmRmMjczODM1MzYwOTEzZTExZGExNGJhNzQxNjFjMzQxNDc1Zjk1MjM4ZThmYmFhMDQxMzBlJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.hxrwDMsuHUugJcawJLM1FZdKw6toEtU8FPyeemc-cg4)
![6_encoderattn_A polar bear is raising its hand_guidance_scale=7 5__textalpha=0 0_alpha=1 4000000000000001_2 jpg](https://private-user-images.githubusercontent.com/144800993/320218881-90144164-9cdb-4a6d-8788-6a70d572fea5.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjE4ODgxLTkwMTQ0MTY0LTljZGItNGE2ZC04Nzg4LTZhNzBkNTcyZmVhNS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1kZWFjMDFjYmM4ZDM4NWZiM2NkZjFhODdlYjVlYjhmNDhjZTczNjMwNTNlZGNjMGIwYjRiOWE5YWE5ODU4MWEyJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.Cs5fOQvxtrRK6PXi8cupNjox4-8lKkcL3gwSi4-3Fvs)
accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='encoderattn' --interpolation=vs --gammastart=11 --gammaend=15 --numtest=7
editing type: object addition
![7](https://private-user-images.githubusercontent.com/144800993/320220568-420d0823-8b73-4bec-9f0e-39b60dcb08f6.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIwNTY4LTQyMGQwODIzLThiNzMtNGJlYy05ZjBlLTM5YjYwZGNiMDhmNi5qcGc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0yOWJhYTg2ZmY3NzhiNTE0Mjc2YWQ0N2RhZmU3ZDQ5ZjQ2MGJiOTA4ZDU3YzJiNDUxMTJmZDdlZDZlODM0Yjg0JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.sUQyVSAkSDpuuajzOtDNOR_zvSjjhLikvVKXnVEi_yc)
![3_orig_A glass of milk next to a stack of cookies on a wooden board with a gray background_guidance_scale=7 5__textalpha=0 0_alpha=1 3_7 jpg](https://private-user-images.githubusercontent.com/144800993/320220704-e49a0ae4-7ac1-4668-8a4c-5b38b788d5fb.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIwNzA0LWU0OWEwYWU0LTdhYzEtNDY2OC04YTRjLTViMzhiNzg4ZDVmYi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT04ZDQ4MDg0MDBiODY2OGE3YjJhMGVmMmU0MDhjZjIzNDEwMzdjZjQ4M2I1NjBlNWVkNTMxOGU5OWMwMTEwNTA4JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.oD5B6AgvxXd1fr68twpt7ShP-ko9GJnvHjpFOQfFw_M)
Input Image, target prompt='A glass of milk next to a stack of cookies on a wooden board with a gray background'
forgedit command
accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='donotforget' --interpolation=vs --gammastart=13 --gammaend=15 --numtest=7
editing type: object removal
![rm7](https://private-user-images.githubusercontent.com/144800993/320221258-20806bfd-e6de-4454-8a42-03264b2c3b47.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIxMjU4LTIwODA2YmZkLWU2ZGUtNDQ1NC04YTQyLTAzMjY0YjJjM2I0Ny5qcGc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0zMDI5Mzk2MTk4NmEzNWE1M2Q1MDJlM2ZjYzlmZGYyMTZhZmFiNzY2NTNiMWUwMTU4ZTA5MjhlNDRhYWEzMzYxJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.C18LjtWEUP8p1fr0A-HeFNfivMiGwl4VvuIFEBKXQok)
![image](https://private-user-images.githubusercontent.com/144800993/320221652-6eec7466-9549-487b-94ae-3800dc7ab039.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIxNjUyLTZlZWM3NDY2LTk1NDktNDg3Yi05NGFlLTM4MDBkYzdhYjAzOS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0wMDY4Yzc2OGQ3YTdkMGVmZTc4OTIwNGQ5YjM3NTUzMTIwN2YyNWZkMTZkZjI5YzRmYTY4N2FhMzRlMDc3NmNjJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.PzYKTKccLc5YX3avmXsYMbTwuH5RiXa3tqrpoRJAeI8)
input image, target prompt='A mountain lake'
forgedit command:
accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='donotforget' --interpolation=vs --gammastart=15 --gammaend=18 --numtest=4
editing type: object replacement
![1](https://private-user-images.githubusercontent.com/144800993/320221772-633ca0fc-3fa9-4ff9-b84e-039434621259.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIxNzcyLTYzM2NhMGZjLTNmYTktNGZmOS1iODRlLTAzOTQzNDYyMTI1OS5qcGc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1kOWYwNDdhNjQzNDEyYjdlYzRiOTQxOTY4Y2Y1MzkwN2Y4ZTUxMzIzOTI1ZjdkZDVhMGI2M2U1NDM2NjI1MzMxJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.h1wWCgfunwMeIp5iBeSnFmy-OXW8q_LMhmOE_nGrvb4)
![5_donotforget_A floor lamp standing next to a potted plant in a cozy room_guidance_scale=7 5__textalpha=0 0_alpha=1 1_1 jpg](https://private-user-images.githubusercontent.com/144800993/320221830-0d2925c6-dc44-4f8a-97ee-37f8e8b347fd.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIxODMwLTBkMjkyNWM2LWRjNDQtNGY4YS05N2VlLTM3ZjhlOGIzNDdmZC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1kMTdhODY1ZWU4NzIzNjY0YWQzYjNiNGJkZmM4MzU1Nzk5Y2E3OTA4YjZkMDMzMmFmNGZjNWJlYjVkODk2ZDA2JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.4KsHxY03fuHW95aIs1adn4CHCDKJC4OeRUKilgTk7tU)
input image, target prompt='A floor lamp standing next to a potted plant in a cozy room'
forgedit command
accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='donotforget' --interpolation=vs --gammastart=11 --gammaend=15 --numtest=7
editing type: background change
![8](https://private-user-images.githubusercontent.com/144800993/320222016-c3192b43-4023-47b2-81be-fa04079b3a43.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIyMDE2LWMzMTkyYjQzLTQwMjMtNDdiMi04MWJlLWZhMDQwNzliM2E0My5qcGc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0yNWMxMTMwZGM5MmMzYzViNGRiOGFhMmJkYWZhNjA3MDlhNjNhOTk1ZTBmMmJiOWEzMDUxMmYxNDEwMjc2NDUzJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.5k7BCcSCEZhB2Igwv_-Maq1psLKSsFpOB3xSaKQaNqE)
![0_encoderattn+encoder1_A silver car parked at a dense jungle_guidance_scale=7 5__textalpha=0 0_alpha=1 4000000000000001_8 jpg](https://private-user-images.githubusercontent.com/144800993/320222089-34f289c9-2194-45ac-a51d-d3e2089cc091.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIyMDg5LTM0ZjI4OWM5LTIxOTQtNDVhYy1hNTFkLWQzZTIwODljYzA5MS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT01NWMxYWQzZDU1ODE0YmE2M2VlOGU3MDc4ZjA0ZDU1ZmE1NjAwNTJmOTcwZDUxMWZlZTk1Mzg2MDk0NDBmZGNkJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.jUmB2HNKRJk6oGuYDk3Ui_GyBLr2RezPPm7EnaHJbMU)
input image, target prompt='A silver car parked at a dense jungle'
forgedti command
accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='encoderattn+encoder1' --interpolation=vs --gammastart=12 --gammaend=15 --numtest=4
editing type: style change
![style2](https://private-user-images.githubusercontent.com/144800993/320222227-86e164f9-4f06-4685-8815-b93b2e12d267.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIyMjI3LTg2ZTE2NGY5LTRmMDYtNDY4NS04ODE1LWI5M2IyZTEyZDI2Ny5qcGc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1lM2ZiZGJiOWEyMDllNmFjOWUyYmZkMGE1ODIzNTE4YzNkZmUxM2Q4ZGQ2ZDgwMWZmOWQ5NmRmOTEwZjY5NmVmJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.jEVtzAoq71LN1tAduvwqDFz7GDK1bWfoH4z31TknJr0)
![2_donotforget_A Van Gogh style painting of a light house sitting on a cliff next to the ocean_guidance_scale=7 5__textalpha=0 0_alpha=1 3_style2 jpg](https://private-user-images.githubusercontent.com/144800993/320222395-cc505262-384b-47f0-aad6-0c8d9884e584.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIyMzk1LWNjNTA1MjYyLTM4NGItNDdmMC1hYWQ2LTBjOGQ5ODg0ZTU4NC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1mMDlmNzdkMWVkMzZlM2JiMTVkZjU3ZTkxNGY1ZDU4YTZlYWQ5MDNlMzdkNjNhYWFhODBkZTZkYTFkMTg4ZTBjJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.fx6jdKM3332jE4-Y0WS4GmKSrt2y6YAo4umxfbSVg8Q)
input image, target prompt='A Van Gogh style painting of a light house sitting on a cliff next to the ocean'
forgedit command
accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='donotforget' --interpolation=vs --gammastart=13 --gammaend=15 --numtest=7
editing type: texture change
![texture2](https://private-user-images.githubusercontent.com/144800993/320222517-17e1b3c3-4bb6-48e0-a184-bf0f7621943e.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIyNTE3LTE3ZTFiM2MzLTRiYjYtNDhlMC1hMTg0LWJmMGY3NjIxOTQzZS5qcGc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1iMjZmNmE5NjFkMzg5ZmMxZjlmMjlmOWViNTQ0YWUxMmRlZTRmMjgwOTliYmIwMjgxYTM1YzEzYjg4N2MxNWIyJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9._2r9nm1f7ZU9yePEGTXbri__EmXR9Xbl2M29ym1Qb_c)
![2_decoderattn+decoder2_A statue of a horse running in a field_guidance_scale=7 5__textalpha=0 0_alpha=1 5_texture2 jpg](https://private-user-images.githubusercontent.com/144800993/320222733-52376d43-a38b-4b01-a03b-415ccdf1bae6.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIyNzMzLTUyMzc2ZDQzLWEzOGItNGIwMS1hMDNiLTQxNWNjZGYxYmFlNi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0wYTY0MTcwZGI5MGFkMzY5MGJkYWJlMmQwYTdhMmNiN2E2ZThiNWI4ZjA5Y2JjMWM2NzgxNDE4OTM2NDA4NWRmJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.CYSCpD5dVYEA4szVbWU4ScdkUHHQkNu72Ahy11jWtY8)
input image, target prompt='A statue of a horse running in a field'
forgedit command
accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='decoderattn+decoder2' --interpolation=vs --gammastart=13 --gammaend=17 --numtest=4
For emotion expression editing, Forgedit could tackle it too.
![test](https://private-user-images.githubusercontent.com/144800993/320223268-2ac8a6ba-9341-419b-82b4-8f9edcf5192c.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIzMjY4LTJhYzhhNmJhLTkzNDEtNDE5Yi04MmI0LThmOWVkY2Y1MTkyYy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1iYTJlZjFkY2JiZDY1OGU5OWNjNThmMDY0YWM5NDhhNDVjMGM3NTQ4YThkMDIyZDQ3MzhmODJlZjIzYTUwZTAyJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.gZfzSqXqXiIeeozcCIf6slHCIj5mjc-V3jdJrfHTtUY)
![image](https://private-user-images.githubusercontent.com/144800993/320223377-e0bffefe-c9f8-44b4-9807-d043efa574d1.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjIzMzc3LWUwYmZmZWZlLWM5ZjgtNDRiNC05ODA3LWQwNDNlZmE1NzRkMS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0zOTcwZmFjZjY1YmMzOTE2N2I3ZDI4MTYzMGE1MTFmMWI3MjY1NDdiMDM1ZjJhMzYxNjhmYzNlYWYzOWI4NDE4JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.pT_UFwvtPLHxc1FBC-xG7OESQOzAP1Iz4gxZxCTQLmo)
![image](https://private-user-images.githubusercontent.com/144800993/320224210-70973ca4-d890-4e50-806c-40c0a4a59edc.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2MTM5MjAsIm5iZiI6MTczOTYxMzYyMCwicGF0aCI6Ii8xNDQ4MDA5OTMvMzIwMjI0MjEwLTcwOTczY2E0LWQ4OTAtNGU1MC04MDZjLTQwYzBhNGE1OWVkYy5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjE1JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxNVQxMDAwMjBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0yYzU5ZmQxYTU4MTQxNmI4N2RmNzUxMDE1OTI2YjQ5MzVjMTE3ZGU4ZmUyZTRiOTZlYWY1NTQ4YmY4NTlmYmFjJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.8JBSVo1v3BMkA9AXQuMvA972WhmWFqkXJP7mRpV26Uw)
Input image, target prompt='a smiling man and a smiling woman'
Here I switch to use realistic vision for human editing, yet I think Stable Diffusion 1.4 should be working too.
forgedit command
accelerate launch src/sample_forgedit_batch_textencoder.py --train=True --edit=True --save=True --forget='donotforget' --interpolation=vs --targeth=768 --targetw=768 --gammastart=8 --gammaend=11
for object move and object size change, there are multiple cases in TEdBench, another text-guided image editing benchmark from Google. Our forgedit could tackle these cases too. The results could be found in Forgedit TEdBench.
Finally, if you have any difficulties reproducing Forgedit's results on editeval, feel free to leave a comment or contact me via email. It would be great if the editing results of Forgedit could be corrected in the next version of this survey. Thanks again.
The text was updated successfully, but these errors were encountered: