首页 > 资讯 > 数据库 >理解MongoDB默认的ObjectID

913

分享到

理解MongoDB默认的ObjectID

2024-04-02 19:04:59 913人浏览泡泡鱼

摘要

BSON ObjectID Specification A BSON ObjectID is a 12-byte value consisting of a 4-byte timestamp (secon

BSON ObjectID Specification

A BSON ObjectID is a 12-byte value consisting of a 4-byte timestamp (seconds since epoch), a 3-byte Machine id, a 2-byte process id, and a 3-byte counter. Note that the timestamp and counter fields must be stored big endian unlike the rest of BSON. This is because they are compared byte-by-byte and we want to ensure a mostly increasing order. The fORMat:

0	1	2	3	4	5	6	7	8	9	10	11
time				machine			pid		inc

TimeStamp. This is a unix style timestamp. It is a signed int representing the number of seconds before or after January 1st 1970 (UTC).

Machine. This is the first three bytes of the (md5) hash of the machine host name, or of the mac/network address, or the virtual machine id.

Pid. This is 2 bytes of the process id (or thread id) of the process generating the object id.

Increment. This is an ever incrementing value, or a random number if a counter can't be used in the language/runtime.

BSON ObjectIds can be any 12 byte binary string that is unique; however, the server itself and almost all drivers use the format above.

分段查看ObjectId的指令及结果如下：

> db.test.findOne()._id.toString()  
ObjectId("50c6b336ba95d7738d1042e3")  
> db.test.findOne()._id.toString().substring(10,18)  
50c6b336  
> db.test.findOne()._id.toString().substring(18,24)  
ba95d7  
> db.test.findOne()._id.toString().substring(24,28)  
738d  
> db.test.findOne()._id.toString().substring(28,34)  
1042e3

ObjectId占用12字节的存储空间，由“时间戳” 、“机器名”、“PID号”和“计数器”组成。使用机器名的好处是在分布式环境中能够避免单点计数的性能瓶颈。使用PID号的好处是支持同一机器内运行多个monGod实例。最终采用时间戳和计数器的组合来保证唯一性。

时间戳

确保ObjectId唯一性依赖的是时间的顺序，不依赖时间的取值，因此集群节点的时间不必完全同步。既然ObjectId已经有了时间戳，那么在文档中就可以省掉一个时间戳了。在使用ObjectID提取时间时，应注意到mongoDB允许各节点时间不一致这一细节。

下面是查看时间戳的两种写法：

> db.test1.findOne()._id.getTimestamp()  
ISODate("2012-12-12T03:52:45Z")  
> Date(parseInt(db.test1.findOne()._id.toString().substring(10,18),16))  
Wed Dec 12 2012 12:11:02 GMT+0800

机器名

机器名通过Md5加密后取前三个字节，应该还是有重复概率的，配置生产集群时检查一下总不会错。另外，我也注意到重启MongoDB后MD5加密结果会发生变化，在利用ObjectID提取机器名信息时需格外注意。

PID号

注意到每次重启mongod进程后PID号通常会发生变化就可以了。

计数器

计数器占3个字节，表示的取值范围就是256*256*256-1=16777215。不妨认为MongDB性能的极限是单台设备一秒钟插入一千万条记录。以目前的水平看，单台设备一秒钟插入一万条就很不错了，因此ObjectID计数器的设计是够用的。

循环插入了一些记录，下面的查询中b是循环计数器，可以看出我机器上的ObjectId计数器是按顺序增加的：

> parseInt(db.test.findOne({b:1000})._id.toString().substring(28,34),16)  
1947382  
> parseInt(db.test.findOne({b:1001})._id.toString().substring(28,34),16)  
1947383  
> parseInt(db.test.findOne({b:1002})._id.toString().substring(28,34),16)  
1947384  
> parseInt(db.test.findOne({b:1003})._id.toString().substring(28,34),16)  
1947385

以下代码源自：Http://www.cnblogs.com/xjk15082/arcHive/2011/09/18/2180792.html

构建objectId   
 public class ObjectId implements Comparable<ObjectId> , java.io.Serializable {  
 final int _time;  
     final int _machine;  
     final int _inc;  
 boolean _new;  
   
 public ObjectId(){  
         _time = (int) (System.currentTimeMillis() / 1000);  
         _machine = _genmachine;  
         _inc = _nextInc.getAndIncrement();  
         _new = true;  
 }  
 ……  
 }

机器码和进程码的生成  
 private static final int _genmachine;  
 static {  
 try {  
 final int machinePiece;  
         {  
 StringBuilder sb = new StringBuilder();  
             Enumeration<NetworkInterface> e = NetworkInterface.getNetworkInterfaces();  
             while ( e.hasMoreElements() ){  
                 NetworkInterface ni = e.nextElement();  
                 sb.append( ni.toString() );  
             }  
             machinePiece = sb.toString().hashCode() << 16;  
             LOGGER.fine( "machine piece post: " + Integer.toHexString( machinePiece ) );  
 }  
 final int processPiece;  
         {  
             int processId = new java.util.Random().nextInt();  
             try {  
 processId = java.lang.management.ManagementFactory.getRuntimeMXBean().getName().hashCode();  
 }catch ( Throwable t ){  
 }  
 ClassLoader loader = ObjectId.class.getClassLoader();  
             int loaderId = loader != null ? System.identityHashCode(loader) : 0;  
 StringBuilder sb = new StringBuilder();  
             sb.append(Integer.toHexString(processId));  
             sb.append(Integer.toHexString(loaderId));  
             processPiece = sb.toString().hashCode() & 0xFFFF;  
             LOGGER.fine( "process piece: " + Integer.toHexString( processPiece ) );  
         }  
 _genmachine = machinePiece | processPiece;  
         LOGGER.fine( "machine : " + Integer.toHexString( _genmachine ) );  
     }catch ( java.io.IOException ioe ){  
         throw new RuntimeException( ioe );  
     }  
 }

您可能感兴趣的文档:

--结束END--

本文标题: 理解MongoDB默认的ObjectID

本文链接: https://lsjlt.com/news/38602.html(转载时请注明来源链接)

有问题或投稿请发送至: 邮箱/279061341@qq.com QQ/279061341

回答

如何调试操作系统的错误？
操作系统

2023-11-15发布

回答

操作系统中的I/O系统是如何实现的？
操作系统

2023-11-15发布

回答

如何实现操作系统的内存管理？
操作系统

2023-11-15发布

回答

什么是虚拟内存，它对操作系统有什么影响？
操作系统

2023-11-15发布

回答

ASP中的MVC架构和WebForms架构有什么区别和使用场景？
ASP.NET

2023-11-15发布

回答

ASP中的数据验证和数据校验有什么不同？
ASP.NET

2023-11-15发布

回答

ASP中的ADO对象和DAO对象有什么区别和使用方法？
ASP.NET

2023-11-15发布

回答

Node.js中的包管理器NPM是什么？如何使用它进行依赖管理？
node.js

2023-11-15发布

回答

Vue.js中的动态组件是什么？如何使用它来动态渲染组件？
VUE

2023-11-15发布

回答

如何使用Vue.js实现懒加载和预加载？
VUE

2023-11-15发布

理解MongoDB默认的ObjectID

BSON ObjectID Specification

理解MongoDB默认的ObjectID

mongodb如何修改默认端口

怎么查看mongodb默认时区

python 默认编码的理解与设置

angular中默认路由的理解用法详解

vue默认插槽的理解与实例代码

MongoDB学习笔记：默认数据库与集合

详解python中的defaultdict 默认值

Python中的默认参数详解

Apache默认解析后缀

如何理解ol和ul的padding和margin默认值

关于对TypeScript泛型参数的默认值理解

redis默认端口是6379的理由

Spring Data默认值的错误解决

C++之默认参数详解

Android9.0的默认配置

phpmyadmin数据库管理工具的默认用户名和密码默认是什么

切换默认python版本（解决ROS中python默认版本为python2的问题）

vue默认插槽的理解及实例代码是怎样的

kubernetes需要默认的serviceaccount的原因解析

关于SQL建表语句使用详解

HBase在大数据审计与合规性追踪中的应用

MySQL与HBase在大数据金融分析中的性能与可扩展性对比

HBase的Region Server之间的网络通信优化

HBase在大数据监控与告警系统中的实时数据处理能力

MySQL与HBase在大数据安全策略中的实现与对比

HBase的分布式事务处理在复杂业务场景中的应用

MySQL与HBase在混合存储架构中的整合策略

HBase如何支持高效的二级索引查询

MySQL与HBase在物联网数据收集与处理中的协作模式