I used to do this sort of shot many years ago, long before Photoshop and digital photography had appeared. Obviously we relied on being at the right place at the right time and using a few tricks of the trade. However because we were shooting transparencies on medium format, everything had to be done to capture the shot in one take.
As such, I am trying to figure out why the need for loads of post production sitting in front of a computer screen? Are you trying to improve the sky? The golden hour and blue hour need planning for, but rarely need a sky cutting and pasting as the light itself creates the pleasing colours and finish. If you do need to replace skies, this is a fairly straightforward Photoshop action with many YouTube videos etc if you are not sure. But if all the skies are cut and pasted there is a risk of the shots all ending up looking 'fake'. Are the type of photos so wide, that a good UWA lens cannot accomodate all you want to see? Is this what you want to stitch together? Or is it a case of wanting to blend shots to avoid burn-out from light in the windows? If it is the latter, just bracket a few shots in camera and stack them in post. Presumably it would be these you want to blend?