You have a 24MPix photo.... ; this is the IMAGE SIZE, and it is NOT always directly related to the FILE SIZE its created from, in terms of the bytes of memory used to store the image data, on disc or in memory or anything..
Image Size: 6000 pixels by 4000 pixels; that defines the 'Hard-Frame'... simplistic analogy.... like, a card-board box, 10cm x 15cm x 20cm = 3000cm3 or 3 liters.
Now... what is in the box doesn't matter... the box will take up 3l of space regardless, wont it?
File Size: Lets get Play-School; We have 3.0l box. We we can put a single marble in it, or a brick... doesn't matter how 'big' of how 'heavy' what we put in it is.... the BOX will always be the same size, right?
OK.... so empty box.... full of air. Weighs a few grams. Lets half fill it with gravel; it now weighs a few Kilograms. BOX is the same SIZE, but its now a heck of a lot heavier.
Marbles and Bricks. We have a box, 20cm by 15cm by 10 cm. Its only just big enough to fit one house-brick in it, but you could probably fit 2oo marbles in there. How 'much' stuff you can get in the box depends on how it fits together and how much free-space is around the stuff you try and squash in, doesn't it?
If you try fitting in big lumps of gravel, chances are that you wont be able to pack as much 'weight' of gravel in the box, as if you have smaller gravel chippings.
So, 6000x4000 pixel IMAGE... how 'heavy' that 'box', how many maga-bytes of FILE size it takes up, is depends on what you fill it with.
OK.... so lets apply 'COMPRESSION'.
Our card-board box is a bit inconvenient, its inflexible restrictive shape, takes up space irrespective of what we stick in it, and we'd like to pack stuff down a bit.... lets use a flexible plastic bag instead. Same 3.0l capacity.. only now it can change shape a bit, and can work really well, if teh stuff you stick in it can change shape a bit too.
Unfortunately marbles and bricks are rather rigid and wont play ball so well, but, if we have soft squashy stuff, like cloths or duvets or grass-clippings, we can stuff the back choka-block, and then sit on it, and squash out all the air, to make it smaller.
Our nominal 3.0l container, might get stretched to get the stuff in it in the first place, but can be shrunk down quite a bit before we pack it away.
So the FILE SIZE, is dependent on the IMAGE SIZE then on the image content, and then on how much COMRESSION is or isn't appllied to it.
OK... Combining Images.
24Mega-Pizel IMAGE size. Fixed frame, lets ignore file size, and ignore compression. And you have TWO of them.
You open them up in a photo-editor and merge them into one NEW image.
How big that image is going to be STILL depends on the frame size you set for it.
You can merge pictures by Pixel ADDITION or you can merge pictures by Pixel SUBSTITUTION, or by a combination of both.
Pixel Substitution. You take Image 1; 6000x4000 pixels. You then take Image 2, and you cut either the whole image or a number of pixels from it as a crop, and then past them IN to Image 1. Image 1 has NOT changes size, and the only way to put those new pixels from the second picture into it, is to take away ones that were there to begin with.
EG: You have photo of Taj-Mahal, and you want to photo-shop Granny who'se never been anywhere more exotic than Bognor infront of it.
You take your 6000x4000 picture of the Taj-Mahal, you take your 6000x4000 picture of Granny, you then cut out the pixels defining Granny in that image, and past the OVER the ones in the picture of the Taj Mahal.
New image, within the frame of the original Taj picture is still 6000x4000 pixels, content has changed, though so even at the same compression level, there is likely to be a small 'weight' difference, due to the different content, BUT overall IMAGE size is the same, you have merged by substitution.
Pixel Addition: To merge two pictures, rather than swapping pixels from one to the other... you add them. Best example is a Panorama-Stitch.
You have looked at a wide vista; and unable to fit it all in your viewfinder at the same time, you take two pictures. One to left of center, one to right of center
Both are 6000x400 pixels image size, but because the content is different, chances are the file-size will be different to.
Line them up side by side like two post-cards, you get a new IMAGE size, 12,000 x 4000 pixels... probably with a horrible band where they join.
In thoery, exactly twice the pixels, and the self same pixels that were in the original images, the FILE size ought to be exactly the sum of the that for the two individual pictures. BUT all digital images, like the card-board box, have some invisible packaging. Card-board adds maybe 0.5mm to the overall size of the content; so a digital image, has some 'packaging data' adding to the file size; good example is the EXIF date and file tags that you can open up seperately to find out about the image.
Your two individual files have thier own packaging; open them up, put them both into one box, and well, chances are you might get away with a bit less packaging, and certainly you only need one 'lable' so its likely that the 'sum' of the two parts is actually a tad less than the sum of the individual parts... though could be more.
But lets look at that horrible join line between pictures:
STITCHING: Stitching, uses 'some' substitution and sum addition to make new image. Over-Lapping the individual 'frames' gives you something in the image to index on and line up one image over the other to over-lap, hopefully getting rid of that join line between the two, and ensuring that the new image flows seamlessly from one side of the new frame to the other.
Two 6000x4000 images; overlapping by what, 30% 2000 pixels; you get full width of first image 6000 pixels, then the 2/3 or 4000 pixels of the second that aren;t the same as the last third of the first. Put them together, you get a new image frame, 10,000x4000 pixels; part substitution, part addition.
SO.... the FILE SIZE of any image, is only loosely related to the IMAGE SIZE, by way of the pixel frame, the content density, and then applied compression.
When Merging Photos to make a new one; the 'new' file size, will again, still be dependent on the pixel frame, content denisty and applied compression.
If you dont increase the pixel frame size, (Substitution Merge) to include the new content; then it wont make a big difference to the file size, which will only be effected by change in content density and any applied compression.
If you increase the pixel frame to include new content (Addition Merge), then the file size is likely to be increased a lot more dramatically, and in loose proportion to the increase in pixel count. BUT still effected by content density and applied compression.
Analogy is, card-board boxes and plastic bags, bricks, gravel grass and duvets.
Things have different sizes, different weights and different volumes, and you are packaging them all up, in quite a complicated manner, where direct liniar relationships rarely follow simple logic.
BUT, Pixels and Bytes are like Liters and Kilograms; totally different commodities, and NOT directly proportional to one another.
One liter of water may weigh One kilogram, but one liter of air will doesn't, it's a lot lighter, and one might be squashed a lot more than the other.