1 files changed, 358 insertions, 0 deletions
diff --git a/rpkid.with_tls/doc/MyRPKI b/rpkid.with_tls/doc/MyRPKI
new file mode 100644
index 00000000..a3cfb6e0
--- /dev/null
+++ b/rpkid.with_tls/doc/MyRPKI
@@ -0,0 +1,358 @@
+****** The myrpki tool ******
+
+The design of rpkid and friends assumes that certain tasks can be thrown over
+the wall to the registry's back end operation.
+
+This was a deliberate design decision to allow rpkid et al to remain
+independent of existing database schema, business PKIs, and so forth that a
+registry might already have. All very nice, but it leaves someone who just
+wants to test the tools or who has no existing back end with a fairly large
+programming project. The myrpki tool attempts to fill that gap.
+
+myrpki is a basic implementation of what a registry back end would need to use
+rpkid and friends. myrpki does not use every available option in the other
+programs, nor is it necessarily as efficient as possible. Large registries will
+almost certainly want to roll their own tools, perhaps using these as a
+starting point. Nevertheless, we hope that myrpki will at least provide a
+useful example, and may be adaquate for simple use.
+
+myrpki is (currently) implemented as a single command line Python program. It
+has a number of commands, most of which are used for initial setup, some of
+which are used on an ongoing basis. myrpki can be run either in an interactive
+mode or by passing a single command on the command line when starting the
+program; the former mode is intended to be somewhat human-friendly, the latter
+mode is useful in scripting, cron jobs, and automated testing.
+
+myrpki use has two distinct phases: setup and data maintenance. The setup phase
+is primarily about constructing the "business PKI" (BPKI) certificates that the
+daemons use to authenticate CMS and HTTPS messages and obtaining the service
+URLs needed to configure the daemons. The data maintenance phase is about
+configuring local data into the daemons.
+
+myrpki uses the OpenSSL command line tool for almost all operations on keys and
+certificates; the one exception to this is the comamnd which talks directly to
+the daemons, as this command uses the same communication libraries as the
+daemons themselves do. The intent behind using the OpenSSL command line tool
+for everything else is to allow all the other commands to be run without
+requiring all the auxiliary packages upon which the daemons depend; this can be
+useful, eg, if one wants to run the back-end on a laptop while running the
+daemons on a server, in which case one might prefer not to have to install a
+bunch of unnecessary packages on the laptop.
+
+During setup phase myrpki generates and processes small XML messages which it
+expects the user to ship to and from its parents, children, etc via some out-
+of-band means (email, perhaps with PGP signatures, USB stick, we really don't
+care). During data maintenance phase, myrpki does something similar with
+another XML file, to allow hosting of RPKI services; in the degenerate case
+where an entity is just self-hosting (ie, is running the daemons for itself,
+and only for itself), this latter XML file need not be sent anywhere.
+
+The basic idea here is that a user who has resources maintains a set of .csv
+files containing a text representation of the data needed by the back-end,
+along with a configuration file containing other parameters. The intent is that
+these be very simple files that are easy to generate either by hand or as a
+dump from relational database, spreadsheet, awk script, whatever works in your
+environment. Given these files, the user then runs myrpki to extract the
+relevant information and encode everything about its back end state into an XML
+file, which can then be shipped to the appropriate other party.
+
+Many of the myrpki commands which process XML input write out a new XML file,
+either in place or as an entirely new file; in general, these files need to be
+sent back to the party that sent the original file. Think of all this as a very
+slow packet-based communication channel, where each XML file is a single
+packet. In setup phase, there's generally a single round-trip per setup
+conversation; in the data maintenance phase, the same XML file keeps bouncing
+back and forth between hosted entity and hosting entity.
+
+Note that, as certificates and CRLs have expiration and nextUpdate values, a
+low-level cycle of updates passing between resource holder and rpkid operator
+will be necessary as a part of steady state operation. [The current version of
+these tools does not yet regenerate these expiring objects, but fixing this
+will be a relatively minor matter.]
+
+The third important kind of file in this system is the configuration_file for
+myrpki. This contains a number of sections, some of which are for myrpki,
+others of which are for the OpenSSL command line tool, still others of which
+are for the various RPKI daemon programs. The examples/ subdirectory contains a
+commented version of the configuration file that explains the various
+parameters.
+
+The .csv files read by myrpki are (now) misnamed: formerly, they used the
+"excel-tab" format from the Python csv library, but early users kept trying to
+make the colums line up, which didn't do what the users expected. So now these
+files are just whitespace-delimted, such as a program like "awk" would
+understand.
+
+Keep reading, and don't panic.
+
+The default configuration file name for myrpki is myrpki.conf. You can change
+this using the "-c" option when invoking myrpki, or by setting the environment
+variable MYRPKI_CONF.
+
+See examples/*.csv for commented examples of the several CSV files. Note that
+the comments themselves are not legal CSV, they're just present to make it
+easier to understand the examples.
+
+***** myrpki overview *****
+
+Which process you need to follow depends on whether you are running rpkid
+yourself or will be hosted by somebody else. We call the first case "self-
+hosted", because the software treats running rpkid to handle resources that you
+yourself hold as if you are an rpkid operator who is hosting an entity that
+happens to be yourself.
+
+"$top" in the following refers to wherever you put the subvert-rpki.hactrn.net
+code. Once we have autoconf and "make install" targets, this will be some
+system directory or another; for now, it's wherever you checked out a copy of
+the code from the subversion repository or unpacked a tarball of the code.
+
+Most of the setup process looks the same for any resource holder, regardless of
+whether they are self-hosting or not. The differences come in the data
+maintenence phase.
+
+The steps needed during setup phase are:
+
+* Write a configuration file (copy $top/rpkid/examples/myrpki.conf and edit as
+  needed). You need to configure the [myrpki] section; in theory, the rest of
+  the file should be ok as it is, at least for simple use. You also need to
+  create (either by hand or by dumping from a database, spreadsheet, whatever)
+  the CSV files describing prefixes and ASNs you want to allocate to your
+  children and ROAs you want created.
+
+* Initialization ("initialize" command). This creates the local BPKI and other
+  data structures that can be constructed just based on local data such as the
+  config file. Other than some internal data structures, the main output of
+  this step is the "identity.xml" file, which is used as input to later stages.
+
+In theory it should be safe to run the "initialize" command more than once, in
+practice this has not (yet) been tested.
+
+* Send (email, USB stick, carrier pigeon) identity.xml to each of your parents.
+  This tells each of your parents what you call yourself, and supplies each
+  parent with a trust anchor for your resource-holding BPKI.
+
+* Each of your parents runs the "configure_child" command, giving the
+  identity.xml you supplied as input. This registers your data with the parent,
+  including BPKI cross-registration, and generates a return message containing
+  your parent's BPKI trust anchors, a service URL for contacting your parent
+  via the "up-down" protocol, and (usually) either an offer of publication
+  service (if your parent operates a repository) or a referral from your parent
+  to whatever publication service your parent does use. Referrals include a
+  CMS-signed authorization token that the repository operator can use to
+  determine that your parent has given you permission to home underneath your
+  parent in the publication tree.
+
+* Each of your parents sends (...) back the response XML file generated by the
+  "configure_child" command.
+
+* You feed the response message you just got into myrpki using the
+  "configure_parent" command. This registers the parent's information in your
+  database, including BPKI cross-certification, and processes the repository
+  offer or referral to generate a publication request message.
+
+* You send (...) the publication request message to the repository. The
+  contact_info element in the request message should (in theory) provide some
+  clue as to where you should send this.
+
+* The repository operator processes your request using myrpki's
+  "configure_publication_client" command. This registers your information,
+  including BPKI cross-certification, and generates a response message
+  containing the repository's BPKI trust anchor and service URL.
+
+* Repository operator sends (...) the publication confirmation message back to
+  you.
+
+* You process the publication confirmation message using myrpki's
+  "configure_repository" command.
+
+At this point you should, in theory, have established relationships, exchanged
+trust anchors, and obtained service URLs from all of your parents and
+repositories. The last setup step is establishing a relationship with your RPKI
+service host, if you're not self-hosted, but as this is really just the first
+message of an ongoing exchange with your host, it's handled by the data
+maintenance commands.
+
+The two commands used in data maintenence phase are "configure_resources" and
+"configure_daemons". The first is used by the resource holder, the second is
+used by the host. In the self-hosted case, it is not necessary to run
+"configure_resources" at all, myrpki will run it for you automatically.
+
+***** Hosted case *****
+
+The basic steps involved in getting started for a resource holder who is being
+hosted by somebody else are:
+
+* Run through steps listed in the_myrpki_overview_section.
+
+* Run the configure_resources command to generate myrpki.xml.
+
+* Send myrpki.xml to the rpkid operator who will be hosting you.
+
+* Wait for your rpkid operator to ship you back an updated XML file containing
+  a PKCS #10 certificate request for the BPKI signing context (BSC) created by
+  rpkid.
+
+* Run configure_resources again with the XML file you just received, to issue
+  the BSC certificate and update the XML file again to contain the newly issued
+  BSC certificate.
+
+* Send the updated XML file back to your rpkid operator.
+
+At this point you're done with initial setup. You will need to run
+configure_resources again whenever you make any changes to your configuration
+file or CSV files.
+
+  Warning:
+      Once myrpki knows how to update BPKI CRLs, you will also need to run
+      configure_resources periodically to keep your BPKI CRLs up to date.
+
+Any time you run configure_resources myrpki, you should send the updated XML
+file to your rpkid operator, who should send you a further updated XML file in
+response.
+
+***** Self-hosted case *****
+
+The first few steps involved in getting started for a self-hosted resource
+holder (that is, a resource holder that runs its own copy of rpkid) are the
+same as in the hosted_case above; after that the process diverges.
+
+The [current] steps are:
+
+* Follow the basic installation instructions in the_Installation_Guide to build
+  the RFC-3779-aware OpenSSL code and associated Python extension module.
+
+* Run through steps listed in the_myrpki_overview_section.
+
+* Set up the MySQL databases that rpkid et al will use. The package includes a
+  tool to do this for you, you can use that or do the job by hand. See MySQL
+  database_setup for details.
+
+* If you are running your own publication repository (that is, if you are
+  running pubd), you will also need to set up an rsyncd server or configure
+  your existing one to serve pubd's output. There's a sample configuration file
+  in $top/rpkid/examples/rsyncd.conf, but you may need to do something more
+  complicated if you are already running rsyncd for other purposes. See the
+  rsync(1) and rsyncd.conf(5) manual pages for more details.
+
+* Start the daemons. You can use $top/rpkid/start-servers.py to do this, or
+  write your own script. If you intend to run pubd, you should make sure that
+  the directory you specified as publication_base_directory exists and is
+  writable by the userid that will be running pubd, and should also make sure
+  to start rsyncd.
+
+* Run myrpki's configure_daemons command, twice, with no arguments. You need to
+  run the command twice because myrpki has to ask rpkid to create a keypair and
+  generate a certification request for the BSC. The first pass does this, the
+  second processes the certification request, issues the BSC, and loads the
+  result into rpkid. [Yes, we could automate this somehow, if necessary.]
+
+At this point, if everything went well, rpkid should be up, configured, and
+starting to obtain resource certificates from its parents, generate CRLs and
+manifests, and so forth. At this point you should go figure out how to use the
+relying party tool, rcynic: see $top/rcynic/README if you haven't already done
+so.
+
+If and when you change your CSV files, you should run configure_daemons again
+to feed the changes into the daemons.
+
+***** Hosting case *****
+
+If you are running rpkid not just for your own resources but also to host other
+resource holders (see hosted_case above), your setup will be almost the same as
+in the self-hosted case (see self-hosted_case, above), with one procedural
+change: you will need to tell configure_daemons to process the XML files
+produced by the resource holders you are hosting. You do this by specifying the
+names of all those XML files on as arguments to the configure_daemons command.
+So, if you are hosting two friends, Alice and Bob, then, everywhere the
+instructions for the self-hosted case say to run configure_daemons with no
+arguments, you will instead run it with the names of Alice's and Bob's XML
+files as arguments.
+
+Note that configure_daemons sometimes modifies these XML files, in which case
+it will write them back to the same filenames. While it is possible to figure
+out the set of circumstances in which this will happen (at present, only when
+myrpki has to ask rpkid to create a new BSC keypair and PKCS #10 certificate
+request), it may be easiest just to ship back an updated copy of the XML file
+after every you run configure_daemons.
+
+***** "Pure" hosting case *****
+
+In general we assume that anybody who bothers to run rpkid is also a resource
+holder, but the software does not insist on this.
+
+  Todo:
+      Er, well, rpkid doesn't, but myrpki now does -- "pure" hosting was an
+      unused feature that fell by the wayside while simplifying the user
+      interface. It would be relatively straightforward to add it back if we
+      ever need it for anything, but the mechanism it used to use no longer
+      exists -- the old [myirbe] section of the config file has been collapsed
+      into the [myrpki] section, so testing for existance of the [myrpki]
+      section no longer works. So we'll need an explicit configuration option,
+      no big deal, just not worth chasing now.
+
+A (perhaps) plausible use for this capability would be if you are an rpkid-
+running resource holder who wants for some reason to keep the resource-holding
+side of your operation completely separate from the rpkid-running side of your
+operation. This is essentially the pure-hosting model, just with an internal
+hosted entity within a different part of your own organization.
+
+***** Troubleshooting *****
+
+If you run into trouble setting up this package, the first thing to do is
+categorize the kind of trouble you are having. If you've gotten far enough to
+be running the daemons, check their log files. If you're seeing Python
+exceptions, read the error messages. If you're getting TLS errors, check to
+make sure that you're using all the right BPKI certificates and service contact
+URLs.
+
+TLS configuration errors are, unfortunately, notoriously difficult to debug,
+because connection failures due to misconfiguration happen early, deep in the
+guts of the OpenSSL TLS code, where there isn't enough application context
+available to provide useful error messages.
+
+If you've completed the steps above, everything appears to have gone OK, but
+nothing seems to be happening, the first thing to do is check the logs to
+confirm that nothing is actively broken. rpkid's log should include messages
+telling you when it starts and finishes its internal "cron" cycle. It can take
+several cron cycles for resources to work their way down from your parent into
+a full set of certificates and ROAs, so have a little patience. rpkid's log
+should also include messages showing every time it contacts its parent(s) or
+attempts to publish anything.
+
+rcynic in fully verbose mode provides a fairly detailed explanation of what
+it's doing and why objects that fail have failed.
+
+You can use rsync (sic) to examine the contents of a publication repository one
+directory at a time, without attempting validation, by running rsync with just
+the URI of the directory on its command line:
+
+     $ rsync rsync://rpki.example.org/where/ever/
+
+***** Known Issues *****
+
+The lxml package provides a Python interface to the Gnome libxml2 and libxslt C
+libraries. This code has been quite stable for several years, but initial
+testing with lxml compiled and linked against a newer version of libxml2 ran
+into problems (specifically, gratuitous RelaxNG schema validation failures).
+libxml2 2.7.3 worked; libxml2 2.7.5 did not work on the test machine in
+question. Reverting to libxml2 2.7.3 fixed the problem. Rewriting the two lines
+of Python code that were triggering the lxml bug appears to have solved the
+problem, so the code now works properly with libxml 2.7.5, but if you start
+seeing weird XML validation failures, it might be another variation of this
+lxml bug.
+
+An earlier version of this code ran into problems with what appears to be an
+implementation restriction in the the GNU linker ("ld") on 64-bit hardware,
+resulting in obscure build failures. The workaround for this required use of
+shared libraries and is somewhat less portable than the original code, but
+without it the code simply would not build in 64-bit environments with the GNU
+tools. The current workaround appears to behave properly, but the workaround
+requires that the pathname to the RFC-3779-aware OpenSSL shared libraries be
+built into the _POW.so Python extension module. At the moment, in the absence
+of "make install" targets for the Python code and libraries, this means the
+build directory; eventually, once we're using autoconf and installation
+targets, this will be the installation directory. If necessary, you can
+override this by setting the LD_LIBRARY_PATH environment variable, see the
+ld.so man page for details. This is a relatively minor variation on the usual
+build issues for shared libraries, it's just annoying because shared libraries
+should not be needed here and would not be if not for this GNU linker issue.