I especially like #1 and #3,
When I started to view #1, I saw an idol on my monitor and thought "Ho-Hum" just another image of a statue but, as I scrolled down, the people on the street came into view and that changed the entire look of the image. The people became a reference point showing how large is the statue and, IMO, really make the image.
The people in image #3 also are references to the size of the building and the area you photographed. They also add color to the image.
I would warm up the images a TAD and perhaps add some contrast and maybe a bit of saturation. That might compensate for the original under exposure. I selected the sky and added some structure which brings out the clouds, I also selected both groups of people in the foreground and brightened then and added a bit of saturation.
This might not have been they way you saw the area but I do like it a bit better. There is some noise in the new image but that, IMO, stems from starting with a small image that I copied from your post. If I started in RAW, I would have added some noise reduction from the start.
BTW: I really like the triangular formation of the groups of people and the canopy forming leading lines up to the tall temple.