I would say everybody smart is doing that, but a lot of the dumb money in AI right now is just wrappers on the GPT API That makes for a flashy demo with no underlying substance or expertise.
They are 100% better for classification at a given compute budget. They can account for information before and after e.g. a token for token classification and use that information to classify.