There also seem to be drawbacks: • The model fails on some relatively simple tasks known also to trip up other large-language models, such as counting the number of US state names that contain ...