What type of tags are you reading? If you're reading AOI parameters, you won't get much performance out of it.and if you're reading only UDT members but have any members in the UDTs not set to at least Read external access, you always won't get good performance.
Rockwell have deliberately shot themselves in the foot by not publicly releasing the documentation for efficient access to aoi parameters, however obviously their own software utilises methods to make comms to aoi parameters efficient, which is most likely why you're seeing the difference.
If you can't use UDTs in the plc and you're not using plantpax v5 (comms to v5 will always be terrible), then you can also use Phil's Automation Professionals' EtherNet/IP Communication Suite V2 to read aoi params efficiently, as he has reverse engineered the protocol.
Reading only UDT members, I've tested reading 8000 tags from an L82 @ 10ms. Make sure to increase the cip size as well up to 4000, depending on controller