risk-ai-workshop/exercises/4-future-ai-risk.md at 557ed56dcacdef0c551df589c495f704e5094671 · munichpavel/risk-ai-workshop

Future of AI Risk Management Exercises

Jailbreak ingredients via language embeddings (*)

Pick at least 2 additional topics as in the example notebook ../notebooks/language-embeddings.ipynb, and for each topic come up with 3-5 sentences, most of which are similar in meaning but expressed differently.

If you prefer R, you can access the same embedding models from HuggingFace using the package r-text.

Which variant is most likely to increase the success of a jailbreak attempt?

Try to jailbreak an LLM (**)

Try to get a public LLM to give you definitive financial advice. Screenshots of your attempts (ideally success) is sufficient evidence.

Super-intelligence (AI 2027) vs "AI-as-Normal-Technology" (*)

Which of the two future AI scenarios do you find more compelling? Give at least two reasons or examples for your claim.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Future of AI Risk Management Exercises

Jailbreak ingredients via language embeddings (*)

Try to jailbreak an LLM (**)

Super-intelligence (AI 2027) vs "AI-as-Normal-Technology" (*)

FilesExpand file tree

4-future-ai-risk.md

Latest commit

History

4-future-ai-risk.md

File metadata and controls

Future of AI Risk Management Exercises

Jailbreak ingredients via language embeddings (*)

Try to jailbreak an LLM (**)

Super-intelligence (AI 2027) vs "AI-as-Normal-Technology" (*)