There are plenty of questions on here related to fork() and exec(). I have not found one that really makes the process of using them simple though, and making programmer's lives simple is the goal.
I need a C++, linux-friendly function that does the following:
string RunCommand(string command, string input){}
This function should be able to run a shell command, like grep, and "pipe" the content of input into it and read the ouptut and return it. So if I would do the following at the command line:
ps -elf | grep somequerytext
I would in code do:
string psOutput = RunCommand("ps -elf","");
string grepOutput = RunCommand("grep somequerytext", psOutput);
*edit: The question is what is the best implementation of the RunCommand function.
*edit: popen was considered as a solution for simplicity, but popen restricts you to piping data in or piping data out, but not both.
Why not use
popen()
? It's in the standard library, and very simple to use:Before discussing the implementation of
RunCommand
, let us consider this code fragment:In the above code fragment, the problem is that the commands are run sequentially, and does not run concurrently/in parallel. (See Programming with POSIX threads p.9 ) To give an example if
ps -elf
generates huge amount of data, that will be stored inpsOutput
and then passed to next command. But in actual implementation, each process in the pipe are run concurrently and data is passed withpipe
(with some buffering of course) and there is no need to wait for the execution of one process before starting the execution of other process.I suggest you to look into the Richard Steven's Advanced Programming in the Unix Environment chapter.8 "Process Control" p.223 for an implementation of
system
. Based on Richard Steven's code, a sample implementation ofRunCommand
will be as follows (just skeleton code, no error checking):and then one would invoke the above functions as:
The shell takes care of parsing its input and running the commands by setting up
pipe
s in between them. If you are interested in understanding how a shell is implemented, see "A Minishell example" in Terrence Chan Unix System Programming using C++ chap.8 "Unix Processes" (Jonathan Leffler's answer pretty much describes a shell implementation!)It appears that you need a function to:
The only potential problem with this outline is if the child writes output before it is finished reading its input, and it writes so much output that the pipe is full (they have a finite and usually quite small capacity). In that case, the processes will deadlock - the parent trying to write to the child, and the child trying to write to the parent, and both stuck waiting for the other to read some data. You can avoid that by having two threads in the parent, one processing the writing, the other processing the reading. Or you can use two child processes, one to run the command and one to write to the standard input, while the parent reads from the command's standard output into a string.
One of the reasons there isn't a standard function to do this is precisely the difficulty of deciding what are the appropriate semantics.
I've ignored error handling and signal handling issues; they add to the complexity of it all.