data:image/s3,"s3://crabby-images/14103/14103bd221442a1e464634360a0fbb90af7c2540" alt="Two audio inputs one output"
If you're not working in real-time, things get much easier. For example, make sure that the sound card's input buffers never overflow, and that the output buffers never get empty. unless you write a kernel-mode driver, which I've never done, but I know it's not easy! The trick is to use buffers effectively.
data:image/s3,"s3://crabby-images/bf018/bf01845c9d750596238d0846e88dbb97c49e7584" alt="two audio inputs one output two audio inputs one output"
Your program NEVER has uninterruptable control of the CPU. REAL-TIME ISSUES - Digital processing in real-time is "interesting" with a multitasking operating system.
data:image/s3,"s3://crabby-images/54c3f/54c3f2dd6c86f8df430f881a3306c4d667290805" alt="two audio inputs one output two audio inputs one output"
Capturing the input from 2 sound cards simultaniously would most likely be much trickier than capturing 4 channels from one card (fewer multi-threading issues to deal with). If you use 2 standard sound cards, you will have to mix digitally. You "capture" the analog sound (either as 4 wave files, or as one 4-channel wave file), and digitally manipulate/mix it later.
data:image/s3,"s3://crabby-images/30a10/30a10086951c7cfb70ea06535d45a3b417cc5650" alt="two audio inputs one output two audio inputs one output"
The standard application for this type of card is as a 4-track digital recorder. From the compatability info, I'd guess their driver works seamlessly with Windows and the WinAPI. I took a quick look at your M-AUDIO link. (A sample in a 16 bit wave file will have a value between -3277.) All of the files would need to have the same sample rate, and you'd have to scale the levels, so that the summed result doesn't attempt to go over "zero dB". If you are not concerned about real-time processing, it would be easy to mix two (or more) wave files, and save the result to another file. But, I assume that Direct-X does have such a function. I'm pretty sure that there are no standard WinAPI functions for mixing sounds digitally. Now, everything above is related to mixing the analog signals in the sound card's hardware. All of the consumer cards are "soundblaster compatable". it might only work with the manufacturer's software. A special purpose multi-input sound card could be a problem, because the driver may not work with the standard Windows API.
data:image/s3,"s3://crabby-images/b5fee/b5fee67e6dbf14d6da6d5694aab6a80e6bcc3b90" alt="two audio inputs one output two audio inputs one output"
You'll need a special (professional) sound card if you need multiple line-level inputs. Most sound cards have only one auxiliary input. I think most of the Direct-X stuff is geared toward real-time processing. Windows has a file named MMSYSTEM.DLL, that has lots of multimedia functions. You can look-up mmsystem (Multi Media System). The WinAPI (Application Programming Interface) probably has most, if not all of the functions that you need.
data:image/s3,"s3://crabby-images/14103/14103bd221442a1e464634360a0fbb90af7c2540" alt="Two audio inputs one output"